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Preface 


This book is a systematic description of the use of the scientific method 
in studying behavior. Although the study of behavior is as old as man, its 
systematic formulations have usually taken the form of pseudoscientific 
systems, each purporting to have the keys to the understanding of man’s 
problems. Relatively few attempts have been made to put forth in simple 
language a description of the scientific method, which has been utilized 
so successfully in other areas. The past hundred years have seen remark- 
able progress in evolving scientific procedures for studying behavior, but 
it is questionable whether these advances have appreciably reduced the 
extent to which pseudoscientific systems are accepted. In systematically 
deséribing how the scientific method can be applied to human responses, 
this book will help to counteract false approaches to the study of man’s 
behavior. 

For many years there has been a need in the field of psychology for 
a textbook on the scientific method. The present book gives the student 
a systematic description of the many concepts with which the scientific 
psychologist must deal. The aim of the book is to present the character- 
istics and concepts of the scientific method as a set of tools with which 
the student can embark on a serious attempt to learn about, and to ex- 
periment with, human behavior. It is common practice, early in his train- 
ing, to expose the student of psychology to the experimental literature. 
There is no question that a knowledge of this literature is essential, and 
that absorbing it is a continuous process in the life of the student. It is 
the experience of the authors, however, that many students begin their 
study of the literature before they have sufficient background to enable 
them to understand, evaluate, and absorb the important concepts they 
are expected to learn. The student will find the experimental literature in 
psychology more meaningful if he can approach it with a well-formulated 
set of tools, even though these tools are not forged to the precision that 
characterizes those used by the mature scientist. 

The field in psychology showing the most promise for future develop- 
ment is research in the problems of behavior. There are many forms of 
research, extending from simple methods of fact gathering to combined 
rational and empirical explorations of highly conceptualized theories 
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about “human nature.” Psychology has reached a stage of maturity when 
it can discard, for the most part, mere collecting of information about 
human responses and can get down to the more profitable business of 
exploring theoretical frameworks applicable to the various kinds of 
human behavior. In recent years many investigators have shown that it 
is not only feasible but very profitable to examine scientifically even the 
most molar types of human response. 

In presenting a systematic account of the concepts widely used by 
scientific psychologists, the authors have not attempted to present all 
possible interpretations of any given concept. At first, the serious stu- 
dent of psychology can readily be overwhelmed by the wide variation 
with which scientific concepts are interpreted. It has been necessary at 
times to give very precise meanings to some of the concepts. This is 
done, not with the intent of persuading the student to adopt the inter- 
pretations suggested, but rather to avoid clouding the issues with varia- 
tions in interpretation that will prevent him from learning the nature of 
the principal arguments being advanced. At this stage in learning it is 
much more important to find the common thread that runs through all 
scientific psychology than to become confused about divergencies of 
meaning that have proved so difficult to resolve even by mature scien- 
tists. After having gained an understanding of the common methods and 
goals of scientific psychology, the student is better prepared to engage 
in polemic discussions on psychological theory. 

The book is written with the expectation that the student will ap- 
proach the study of scientific psychology as a scholar. It is, then, not a 
compilation of experimental studies, written in recipe style for easy mas- 
tery. The aims and methods of scientific psychology are treated as prin- 
ciples to be learned at a generalized level so they can be applied to any 
problem within any particular field. Concreteness is given to the princi- 
ples by means of illustrations drawn from many special areas of psycho- 
logical research. There has been an attempt to make the 
difficult throughout, without omitting any concepts that are 
portant for the beginning student in scientific psychology, 

The nature of the content of the book and the particul 
presentation utilized make it adaptable to the needs and special interests 
of a wide variety of college courses. In mimeographed form it has served 
the authors in beginning courses in experimental psychology. It has been 
used in courses that have included a weekly 3-hour laboratory period 
and in courses consisting only of lectures or lecture-de 
With the omission of certain chapters, the mimeogr. 
been adapted to courses carrying only two units of e 
also served as supplementary 
courses and for graduate cours 
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The book is divided into three parts. Part One explains certain general 
concepts of the scientific method and particularizes them in terms of the 
problems of psychology. Part Two discusses the principal steps of the 
method, beginning with the formulation of a problem that initiates a 
study and ending with the formulation of the generalizations that termi- 
nate it. In Part Three, many of the special procedures of scientific psy- 
chology are presented. Here the student comes to grips with some of 
the particular tools he will need to master if he intends to perform scien- 
tific research of any kind in the area of human behavior. 

Suggested readings are given at the end of each chapter. These are not 
included for the purpose of documenting the arguments of the text, but 
rather to assist the interested reader in initiating further exploration of 
those general areas that appeal to him. The number of selections is pur- 
posely kept small. The student will find of interest in these references 
many chapters that are not specifically cited. 

The authors are indebted to many of their colleagues and graduate 
students in psychology for advice and criticism. Particular thanks are 
expressed to Professors D. G. Ellson, L. J. Postman, and T. R. Sarbin. 
The following graduate students, who over the years assisted in the 
teaching of demonstration and laboratory sections in which earlier ver- 
sions of the text materials were used, made many valuable sugges- 
tions: Richard Barthol, Richard Christie, Samuel Fillenbaum, Theodore 
Kroeber, Frank Meeker, Donavon Morrison, Herbert Naboisek, Harvey 
Peskin, Carlos Quadra, Benjamin Rosenberg, Harold Sampson, William 
Sickles, George Stone, Frank Vanasek, and Everett Wyers. 

The authors wish to express their appreciation for permission to repro- 
duce the following materials: quotation in Chap. 4 from A. A. Brill, “The 
Basie Writings of Sigmund Freud,” Modern Library, Inc., and George 
Allen & Unwin, Ltd.; scaled statements in Chap. 12 from “A Scale for 
Measuring Attitude toward War,” by D. D. Proba, The University of 
Chicago Press; Fig. 12, Chap. 15, from Report No. 1, The Aviation Psy- 
chology Program in the Army Air Forces; material for Tables 6 and ts 
Chap. 15, from Treatment of Schizophrenia by J. S. Gottlieb and Paul 
E. Huston, Archives of Neurology and Psychiatry; Fig. 13, Chap. 15, 
from Overcoming Resistance to Change, by L. Coch and J. R. P. French, 
Human Relations; Table 8, Chap. 15, from Assessment of Persons 
through a Quasi Group-interaction Technique, by R. S. Crutchfield, 
Journal of Abnormal and Social Psychology. 

CLARENCE W. Brown 
Epwin E. GHISELIA 
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PART ONE 


Some General Concepts about the Scientific Method 


Every discipline has an underlying rational basis, and in this respect 
science does not differ from other disciplines. There are certain funda- 
mental philosophical notions about natural phenomena that we should 
examine early in our study of the scientific method. In addition, there are 
several basic problems that must be dealt with by every scientist regard- 
less of what type of subject matter he chooses to study. These problems 
should also be surveyed early, and consideration given to how they are 
solved in the field of psychology. 

Chapter 1 briefly reviews the ways in which the word science is used. 
The point of view is developed that science is primarily a method and as 
such it has applications to all kinds of natural problems. Successful exe- 
cution of the method, of course, demands a scientist who must be willing 
to learn its intricacies and abide by its rules. Being human, the scientist 
is subject to errors, and therefore scientific findings can never be inter- 
preted as infallible. 

In dealing with natural phenomena, the scientist must take certain 
things for granted. These presuppositions are described in Chap. 2. They 
concern in part the nature of the physical universe with which the 
scientist deals and in part the nature of the psychological processes that 
are involved in his role as scientist. Knowledge of how he contributes 
as part of the scientific method enables the scientist to minimize bias 
and achieve a high degree of objectivity. 

The aims and methods of science are discussed in Chap. 3. The most 
general aim of science is understanding, but there are several more 
specific aims that we achieve on the way to understanding. As a method, 
Science can be described at such a general level that it encompasses all 
subject-matter disciplines. When applied to particular problems, how- 
ever, the method becomes a very large number of specific procedures. 

The concept of cause and effect has been central to man’s thinking 
both as a scientist and as a nonscientist. It is a particular way of describ- 
ing relationships among natural events. In Chap. 4 we examine this 
concept in a pragmatic way to determine what we actually are able to 
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observe in a relationship that we refer to as causal. For the purpose of 
the scientist, the concept of functional relationship seems sufficient and 
squares with the facts. 

Two very fundamental problems faced by every scientist are the prob- 
lem of controlling the variables relevant to his project and the problem 
of quantitatively describing these variables. Control of variables is dis- 
cussed in Chap. 5. The first psychologists followed closely the lead of 
the natural scientists in attempting to control variables by procedures of 
physical manipulation. They soon learned, however, that differences be- 
tween phenomenal changes in human beings and phenomenal changes in 
inanimate physical objects were so great as to require procedures espe- 
cially designed for the variables operating in human behavior. Much of 
the scientific psychologist’s time is spent in the development of pro- 
cedures for achieving the degree of control needed by his particular kinds 
of hypotheses. 

Describing changes in behavior is a core procedure in all science. As 
explained in Chap. 6, the highest possible accuracy is obtained in quanti- 
tative description. The scientist has continually appealed to the number 
meanings of mathematics as a means of improving his descriptions. The 
application of these number meanings to human behavior is subject to 
many errors. It is necessary for us to realize fully the increased precision 
that mathematical and statistical procedures afford the psychologist, but 
at the same time we must be aware of the fundamental assumptions 
underlying these procedures. Only when the behavior data to be analyzed 
meet the assumptions underlying the formulas or equations we desire 
to use are we justified in availing ourselves of these procedures. 


CHAPTER 1 


Some Characteristics of Science 


Without expending any great effort we can collect many divergent state- 
ments on what constitutes science. These statements will vary so widely 
in their emphases that an aspect held important by one authority will not 
be included in the definition of science as given by another authority: 
If we are to study scientific method in psychology, it is essential for us 
to make a beginning toward defining that method. The following sections 
are directed toward establishing, in a preliminary way, a concept of 
science, ` 


SOME CURRENT INTERPRETATIONS OF THE TERM SCIENCE 


As is true of most words, the word science has come to have a variety 
of meanings. Let us briefly review some of the more frequently used in- 
terpretations that have stemmed from notions about the scientist and 
his work. 

Science and Subject-matter Fields. The word science is often used in a 
generic sense to refer to the so-called “sciences” or fields of scientific sub- 
ject matter, such as physics, geology, astronomy, and the like. In high 
school this meaning is used in designating a certain class of courses, the 
so-called “science courses,” as contrasted with other kinds of courses, 
such as those in history, civics, and economics, which are called the 
humanities, or courses in literature, drama, and music, which are called 
the arts. Science is, then, conceived as a term to be applied to only the 
“hard” things of nature and is not to include the less tangible phenomena 
of human behavior. This interpretation of science is widely held by the 
general populace. 

Science and Complicated Gadgets. The word science is also applied to 
activities in which such intricate instruments as microscopes, electric 
meters, complexly arranged glass tubes, etc., are used. Such gadgets are 
considered too difficult for most people to understand and use. Presum- 
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ably, science is reserved for the select few who have the inclination and 
ability to deal with such complex things. 

Science and Universal Laws. Sometimes the word is used to refer to 
theories and laws that have been evolved to explain natural phenomena. 
Almost everyone has heard of the law of gravity and the law of relativity. 
and somehow science comes to stand for these and other principles and 
theories by which the scientist explains the workings of the natural 
universe. Thus, to many, science means abstruse descriptions of how the 
universe operates. 

Science and Systematic Procedures. A further use of the term science 
associates it with what is called the scientific point of view. This is given 
a variety of interpretations, sometimes referring to the making of a pur- 
poseful search, sometimes to the trusting of only facts, sometimes to the 
use of prolonged deliberation, sometimes to the application of involved 
mathematics, or to the use of some other supposed unique characteristic. 
There is no unanimity as to what is meant by being scientific about a 
problem except the idea that the scientific approach is somehow superior 
to any other approach. 

Science as Technical Methodology. Lastly, science is looked upon as 
comprising technical methods for solving problems. This is one of the 
most widely held meanings. Presumably, years of training are required 
to master the techniques. There are some who think that these scientific 
techniques are applicable only to the physical or material elements of 
nature, 

Pseudoscientific Schemes. Mention should be made here of the falla- 
cious application of the term in connection with certain pseudoscientific 
schemes devised for understanding and predicting human behavior. 
Astrology, phrenology, physiognomy, chirognomy, chiromancy, palmistry, 
,chemotypology, graphology, and other kindred systems are presented as 
scientifically derived methods for explaining and controlling human be- 
havior. It is not necessary here to evaluate these pseudoscientific ap- 
proaches. Rather, it is more important that the reader do this for himself 
after he has gained an understanding of the scientific method. It is neces- 
sary to state, however, that proponents of each approach appeal to facts 
to gain support for their point of view. Furthermore, it must be affirmed 
that in some instances these facts are as well supported as the facts in use 
by the scientist. 8 

The statement that the above-named appro: 
havior are pseudosciences results from the autl 
one as a total system. No claim is being ma 
system are pure fancy. Neither is it he 
ported by facts should be 
clared unscientific. 


aches to the study of be- 
hors’ judgments about each 
de that all parts of every 
Id that those parts that are sup- 
denied because a scheme as a whole is de- 
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SCIENCE AS BOTH GENERAL AND SPECIFIC METHODS 


The, intensive cultivation of science in the narrow fields of physical 
subject matter has resulted in the development of a seriously retarding 
influence, viz., it has tended to encourage the idea that there is only one 
method of science and that this method is that of the physical sciences. 
Actually, as method, science can be interpreted either as a single general 
method or as many specific methods. It is fallacious to restrict the term 
to the procedures used in the study of physical phenomena. 

Science as a General Method. When we study the wide variety of situ- 
ations considered scientific, examining their similarities and their differ- 
ences, the one feature that is most common, that stands out most promi- 
nently, and that seems to form the essence of science is its general 
method. Starting, then, with this as a premise, we can ask the question: 
Is there one general method of science? The answer is that on a highly 
conceptual level science may be considered a general method. When 
scientists study specific problems, however, this general method is modi- 
fied in numerous ways, and many of these adaptations are of sufficient 
importance and sufficiently general in nature to be considered methods 
within themselves. Science, then, is a very general method, modified in 
various ways into many less general methods that are utilized in the 
study of specific problems. 

Science as a Multiplicity of Methods. The scientific method has been 
varied in a tremendous number of ways. No two problems are exactly 
alike, and no one method can be applied in an invariable manner to 
different types of problems. Every investigator has had to adapt existing 
methods to the specific conditions of his projects. One of the frequent 
mistakes of the young scientist is his failure to recognize the need for 
modifying available methods to suit his particular problem, with the result 
that the methods he uses are sometimes inappropriate. He later finds that 
such carelessness either produces totally invalid results or greatly reduces 
the precision of his interpretations and generalizations. 

Factors Underlying the Modification of the Method. It is not necessary 
here to consider specific modifications of the scientific method. Many 
illustrations will be found throughout the text. Mention should be made, 
however, of several factors that underlie modifications in the method. 
Three of these are as follows: the nature of the subject matter, the nature 
of the specific problem, and the stage of the inquiry. 

Variation in method due to difference in subject matter may be illus- 
trated by contrasting physiological psychology and social psychology. 
The study of brain mechanisms in learning demands histological and ex- 
perimental methods; the determination of racial prejudice in minority 
groups demands survey and population sampling methods. 
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The nature of the problem very definitely conditions the method to be 
used. Inquiry starts, not from a method, but rather from a problem, and 
thus from the beginning the problem will condition the steps taken to 
solve it. Different types of problems raise different types of questions 
and require different kinds of procedures to achieve the answers, Let us 
consider two problems concerned with the relationship between ability 
in arithmetic and success in clerical work. One problem involves voca- 
tional selection, the other vocational guidance. The first problem requires 
information about the relationship between arithmetic ability and clerical 
Success in order to determine if an arithmetic test should be used as a 
device for selecting clerks. The second requires similar information in 
order that vocational advice can be given to a graduating high school stu- 
dent who thinks he would like to become a clerk in a banking firm. In 
the first problem we would need to learn if applicant clerks scoring high 
on an arithmetic test will manifest superior performance when hired and 
trained as clerks. In the second problem we would need to know how 
performance on an arithmetic test, when combined with other informa- 
tion known about a graduating high school student, assists in prognosti- 
cating his success in the clerical profession. Even though the information 
needed appears to be similar, the problems are radically different, and 
different methods would be required to solve them. 

Variation in methodology associated with the stage of the inquiry 
may be exemplified in the use of logic, experiment, and statistics as indi- 
vidual procedures within a scientific study. Logic enters the picture early 
during the formulation of the inquiry; experiment enters later whe 
empirical steps are being designed to collect the data; statistical me 
enter at this stage and also at a later period when the data are being 


analyzed; and logic again enters when the implications of the findings are 
being evolved and formulated as generalizations, 
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workaday world. Facts are compelling aspects of experience to which 
everyone must adjust. ° 

To the nonscientific person, there seems to be inherent in his concept 
of fact “a given” or “a finality” characteristic that brooks no opposition. 
“A fact is a fact and that is all there is to it.” This is not the interpreta- 
tion of the scientist, who, more than any other person, has the task of 
discovering and identifying facts for all of us. Facts are not given but 
are discovered through inquiry. Facts do not possess the characteristic of 
finality but are continually undergoing change as inquiry proceeds. 

The essential task of the scientist is to identify facts with the highest 
possible precision, for they form the stock in trade of all of his work. He 
discovers facts, he describes facts, he relates facts, he makes judgments 
about facts, he explains facts, and he generalizes from his facts. The 
center of focus of all of his activities, whether they are concerned with 
empirical search or with rational manipulation, is facts. 

Facts Defined. A general definition of the term fact is as follows: an 
experience, event, change, or occurrence for which there is substantial 
evidence. As with most definitions, there is need of further elaboration. 
We should keep in mind that the word fact is a generalized concept and 
therefore may be assigned more than one meaning. One way of interpret- 
ing fact is to conceive it to be a continuum of experience, from experience 
that is immediate to experience that is highly conceptual. The one essen- 
tial characteristic of all facts is the substantiation of their existence 
through evidence. In the following discussion, three levels of fact are 
outlined, It is to be understood that these levels are chosen arbitrarily 
to highlight three points along the continuum of fact. 

Facts of Immediate Experience. One kind of immediate experience is 
called “raw” or “brute” experience. It refers to experiences that are “un- 
cluttered” even by names. Such would be the experiences of a young 
infant. These can be called the most “factual of facts” because they have 
not been subjected to change through the individuals thinking about 
them. 

It is questionable whether a person is capable of having raw or brute 
experience once he has learned to assign names to his experiences. The 
assignment of a name to an experience is the first step in conceptualiza- 
tion. The individual’s intellectual processes are brought into play imme- 
diately upon the presence of a stimulus, to identify, interpret, and assign 
meanings to the new experience. Thus his experiences have a conceptual 
component and are no longer the immediate “naked” experiences that a 
young infant might have. 

Facts Describing Immediate Experience. These facts are abstract and 
conceptual in nature, and describe and interpret immediate experience. 
They involve the combining of several meanings into a composite. Mean- 
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ings directly aroused by sensory stimuli are combined with meanings 
dependent upon the arousal of previous experience. Regardless of the 
complexity of the psychological genesis of this composite type of ex- 
perience, it is readily considered a fact. This kind of fact is exemplified 
by such words as house, tree, rock, table, book, muscle. head. etc. 

The least conceptualized experience is primarily sensory in nature. 
Some examples are touches, smells, sounds, and pains. More conceptual- 
ized in nature are the thought or rational experiences exemplified by 
ideas, memories, and images. Sensory facts may be referred to either 
external or internal sources; 


rational experiences are referred only to 
internal conditions. This distinction between external and internal is 
made because of the difference in ease with which agreement can be 
attained about facts of these two types. Suppose we present a small box 
to two individuals, allowing each one to see and lift it. We then ask 
each to describe his visual and his kinesthetic experiences of the box. 
We shall find much more agreement in the descriptions of the visual box 
than in the descriptions of the kinesthetic box. 

Conceptual experiences are one of man’s greatest assets because they 
are the only ones that he can mentally manipulate, They enable him to 
perform three functions, namely, to represent the past in the present, to 
react in the present with implicit or covert responses, and to sarry the 
past and the present into the future in the form of foresight and plan- 
ning. An illustration of the first function should prove sufficient. The 
simple recall of a sports event will be used. We can recall and think 
about yesterday’s baseball game by manipulating the names of the 
players, the numbers of the innings, the words that depict the plays that 
were made, the decisions of the umpires, etc. We reconstruct the 
the use of conceptual facts. 

Facts describing immediate experience are 
Counterparts in sensory experience 


game by 


not highly conceptualized. 


are usually readily found, and the 
conceptualization is primarily associated with the recall 


Facts Remote from Sensory Experience. 
may transcend sensory experience. Some meanings are never displayed 
to view in any sensory manner. They never exist in a form in which they 
are open to sensory inspection. These meanings are evolved primarily 
through the use of reason. Despite the fact that they are highly con- 
ceptual in nature, when they are supported by sufficien 2 
dence, they are accepted as fact. 

One way of arriving at a fact of this type is through generalization. 
An example is represented in the following statement: All sharp objects 
strongly pressed against the skin will] cause pain. It is not 
have a sensory experience of such a general proposition. 
soning, we form the general proposition from 


of past meanings. 
It is obvious that meanings 
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experiences with sharp objects. When supported with sufficient evidence, 
such a generalized proposition is considered a fact. 

Another example of this type of fact is the relationship between two 
concepts. Such a relationship is not observable in sensory experience, but 
there may be sufficient evidence of its existence to compel its acceptance 
as a fact. That arithmetic ability is positively related to spelling ability 
is accepted as a fact. Both abilities are concepts and are never actually 
experienced. They are inferred from the concrete performance of indi- 
viduals in solving problems and in spelling words. Likewise, the rela- 
tionship between the abilities is experienced only on a conceptual level. 
However, there is a sufficient amount of evidence traceable to various 
forms of sensory experience to justify accepting as fact the conclusion 
that a positive relationship exists between the two abilities. 


THE SCIENTIST’S ROLE IN SCIENCE 


Scientific generalizations are the joint product of two factors, namely, 
the scientific method and the scientist. The scientific method is the prod- 
uct of man’s ingenuity, and its successful application depends upon a 
continuous exercise of this ingenuity. Sometimes the point of view is 
expressed that science is too difficult ever to find widespread use. This 
proposition requires further examination. 

The Concept of the Scientist. Peculiar notions have arisen concerning 
the traits and characteristics possessed by the scientist. Popular accounts 
of scientific discoveries may conceive the scientist to be an elderly 
gentleman with bushy gray hair, peering from behind horn-rimmed 
spectacles, Or they may conceive him to have a goatee and always be 
wearing a white laboratory coat. These conceptions about the scientist 
extend into his working space and are concerned with certain “gear and 
trappings” that command awe and respect from the uninitiated. Some- 
times the necessary accoutrements consist of a laboratory room with 
benches on which there are long, twisted tubes of glass, gas burners, 
beakers, and bottles of mysterious chemicals. Sometimes they consist of 
motors, gears, wires, and meters accompanied by characteristic whirrings, 
grindings, and crackles. Sometimes, but not very frequently, they may 
include an ordinary room with a desk on which there are sheets of printed 
materials. 

Regardless of what the popular conception of the scientist is or what 
kind of trappings are thought required for his “trade,” there is a prone- 
ness to accept him uncritically as a peculiar individual engaged in a 
mysterious calling that has to do with analyzing the nature of things, 
such as stars, rocks, chemicals, forces, and the like. Although in very 
recent years there has been an increased understanding of the importance 
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of the contribution of scientists, seldom have questions been raised con- 
cerning the use of the scientific method as a possible and workable 
method for attacking the everyday problems of the “ordinary” man. 

A definition of a scientist that is readily understood can be stated as 
follows: He who rigorously applies the scientific method is a scientist. 
It follows that any individual can be scientific by rigorously applying the 
scientific method. This interpretation is not difficult to understand be- 
cause it stems directly from the most widely accepted definition of 
science. Furthermore, there is nothing new about this conception, for it 
has been championed over the years by those individu 
to speak, namely, the scientists, Despite its acce 
however, this conception runs counter to most o 
stition which beclouds current popular understanding of science. 

If this conception is correctly interpreted, it means that an individual 
can be scientific without the characteristics of bushy hair, horn-rimmed 
spectacles, goatee, or white laboratory coat. It means that scientific re- 
sults can be achieved in the absence of a room full of test tubes, gas 
flames, whirring motors, and crackling electric sparks. A further inter- 
pretation of the conception means that scientific results are not limited 
to a study of stars, chemical elements, cosmic rays, and the like, but that 
a scientific attack can be directed on problems of a more mundane na- 
ture, such as the frustrations and failures, the confusions and conflicts, 
and the prejudices and perversities of ordinary people. 


The Nature of the Scientist’s Contribution. Dame Nature is somewhat 
reluctant to reveal her secrets and 


does so only when prodded. The 
scientist acts as a probe; his task is to prod nature into displaying her 
workings. His search, then, must be an active one. He is not a mere 
Passive recorder, registering succe: 


ssions of sensory impressions as events 
occur before him, but he busies himself in devising 
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considered. 


op 0 be estigated is a product of his interest, knowl- 
edge, training, motivation, and other similar Personal characteristics. As 
a method, science provides a ans for studying his hypothesis. Some 
kind of investigation, such as a su 
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ment, is organized through which empirical evidence pertinent to the 
hypothesis can be collected. 

In the conduct of the investigation, regardless of its nature, the pri- 
mary method that will be followed will be observation. Obviously, the 
scientist must be the observer. 

With the data collected, the next task is that of analysis. There are no 
analytical procedures available that can be applied automatically. The 
scientist must not only carefully select the appropriate procedures, but 
it is his responsibility to see that those selected are executed with high 
fidelity. 

Following the discovery of all the facts there comes the task of inter- 
pretation. This is a stage of high subjectivity, in which the pitfalls are 
both numerous and subtle. To do the job adequately the investigator 
must utilize his scientific acumen to the utmost in order to discover all 
of the hidden meanings and still adhere steadfastly to the facts. Seldom 
will he complete his study without discovering some “loose ends” that 
affect his generalizations and that escaped notice despite his attempt to 
control rigorously all of the pertinent variables. Never will he reach 
conclusions that will be wholly free of ambiguity. 


SCIENTIST AND NONSCIENTIST COMPARED 


In spite of the aura of magic and mystery with which the layman 
surrounds the scientist, it is actually found that in many respects the 
scientist’s labors are not very different in kind from those characterizing 
a nonscientific person reacting in the ordinary situations of everyday 
living. 

The Approach of the Scientist. The stimulus that goads the scientist 
into action is a problem. Some question is raised for which there is no 
immediate answer. He is provoked into thinking about it and into pon- 
dering about possible answers to it. As he mulls over the problem, the 
question at stake becomes clarified, and he verbalizes it as accurately as 
he can. Often this verbalization is formed into a hypothesis that can be 
empirically studied. He then devises situations from which he can collect 
facts that will be pertinent to the problem and from which he might be 
able to construct a solution. He is persistent in this search for facts and 
continues until he can establish a good case either for or against the 
hypothesis. He is careful, however, not to exclude any facts because of 
Personal bias, but allows every fact to contribute its share to the total 
picture. Thus, he establishes a sound factual basis from which he can 
devise generalizations about the problem that initiated the study. 

The Approach of the Nonscientist. The various steps outlined above, at 
least in a very general way, find their counterparts in the behavior of the 


12 Some General Concepts about the Scientific Method 


nonscientific person. To begin with, he encounters a wide varicty of 
problems, and a large proportion of them merit the sound and careful 
attack used by the scientist. Likewise, the nonscientific person frequently 
thinks about his problems, but often in an ineffective way that can best 
be described as “stewing” or “fretting.” He is usually not so successful 
as the scientist in reaching the kernel of the problem, in stating it in the 
form of an answerable question, or in devising a workable hypothesis. 
His failure, however, is not due to the impossibility of applying a more 
rigorous analysis but rather to his ignorance or lack of experience in 
using a more rigorous method. 

Failure to state the problem accurately markedly handicaps his sub- 
sequent attempts to devise an answer. He frequently will seek out facts 
that are not pertinent to the problem. Here again his ineffectiveness is 
often due to lack of discipline and training. He may not continue his 
search long enough, and he may be content to formulate generalizations 
prematurely, before sufficient facts have been collected to justify them. 

Again, in his search he may allow some bias to influence his selection 
of the facts, and this will result in an invalid base on which to establish 
his generalizations. Frequently this biased selection of the facts is not 
known to the individual, and in many cases, if it were known, he would 
take steps to counteract it. 

Having made a generalization, the nonscientific person is less willing 
subsequently to check it as further facts become known. He persists in 
arguing for his generalization, either because he has strong feelings con- 
cerning the certainty of it, or because he is fearful of the changes he 
would have to face were he to give it up. 

A Comparison in Terms of Individual Flexibility. One of the 
characteristics of the scientist is his flexibility. His purpose is to improve 
his beliefs rather than to defend them. He is suspicious of his generaliza- 
tions. He is forever questioning their validity and deliberately secking 
further facts in order to test them. His creed calls for a continuous revi- 
sion of all phases of his work as additional knowledge is accumulated. 
whether method or result, whether hypothesis or generalization, Of all 
people he is the most expert in changing his mind; that is, he is continu- 


ally vigilant to bring his findings up to date in terms of trustworthy 
evidence. 


chief 


The attitude of the nonscientific 


person is in sharp contrast with this. 
As he grows up, he forms habitu 


al ways of responding and becomes 
accustomed to rely on them most of the time. He is schooled to take for 


granted whatever is familiar, traditional, or customary. He stabilizes his 
behavior in an attempt to resist change, and comes to accept a certain 
amount of frustration and failure as inevitable. He enters the school of 
hard knocks and there builds up tolerance for various. forms of social 
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and personal dislocation. A little confusion, anxiety, and worry can 
prove profitable experiences along the road to understanding, but he 
accepts them as a continuing necessity. He actually becomes adjusted to a 
certain amount of maladjustment. He not only tolerates maladjustment 
but he accepts it as a necessary steppingstone to tranquillity. Such be- 
liefs form a powerful barrier to modifications in behavior and are handi- 
caps to his discovering adequate solutions to his problems. 

The Scientist’s Tolerance of Change. The scientist thrives on change. 
He is progressive because he is never too certain of his facts. He en- 
courages systematic doubt. Critical thinking results only when there is 
doubt. When there is no doubt, there can be no science. Progress is 
achieved by modifying the findings and techniques of the past and thus 
making possible a continuous adjustment to the changing conditions of 
the present. 

Instead of tolerating failure, the scientist endeavors to evolve methods 
for avoiding it. In the face of failure, he demands a change. When old 
procedures begin to lose their usefulness, when they are no longer ade- 
quate for satisfying current needs, he demands that they be modified 
or that new methods be devised. He does not admit that past methods 
will continuously measure up to the problems of the present and the 
future. 

Although the scientist welcomes change, he does not abandon the 
concepts of the past upon the first revelation of new findings. He trains 
himself to be an expert in changing his mind. He fires everything in the 
crucible of experience and accepts it only when it has demonstrated its 
serviceability. He critically evaluates all of the facts and changes only 
when the weight of the evidence demands change. He is conservative in 
that he first wants to be convinced by facts. He is progressive in that he 
will readily accept change when the facts warrant it. 
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CHAPTER 2 


Presuppositions of the Scientific Method 


All of us take for granted certain characteristics of the objects and activi- 
ties we experience. For example, we take for granted the continuity of 
events in time. We live today expecting that there will be a tomorrow. 
Tomorrow we expect to encounter about the same conditions as today, 
that is, the same people, the same objects, the same kinds of relation- 
ships, ete. We expect these people to manifest about the same kinds of 
behavior when confronted with about the same kinds of situations. We 
expect the objects to occupy about the same points in space and to be 
serving us in about the same ways as they do today. 

We also take for granted the accuracy of the activities of knowing, 
those activities by which we come to understand ourselves and our en- 
vironment. Few of us take the time to examine the accuracy of our per- 
ceptions, memories, and reasonings concerning the world around us. For 
example, if I see a dress as red, I do not question that the dress is red. 
If I recall that I mailed a letter yesterday, then the letter was mailed 
yesterday. If I reason that today is my birthday and my wife has baked a 
cake for me on my birthday every year of our married life, then I do not 
even question my conclusion that there will be a cake awaiting me when 
I return home tonight. In the foregoing examples involving the primary 
knowing activities of perception, memory, and reasoning, full trust is 
placed in the final outcome of the psychological processes utilized. As a 
matter of fact, we are not even aware most of the time that there is a need 
for questioning the trustworthiness of these activities and so we remain 
wholly ignorant of the subtle ways in which inaccuracy in these responses 
may influence our day-to-day reactions. 

For the scientist’s scientist, ignorance about any matters that bear upon 
the correct execution of the scientific method is not to be tolerated. It 
is necessary then to consider some of these presuppositions that the 
scientist must accept as a basis for his procedures, and to learn how 
they may influence the execution of these procedures. The propositions 
that deal with the fundamental nature of the universe of objects and 
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events with which the scientist works are primarily the concern of the 
philosophy of science. The propositions that directly affect the applica- 
tion of scientific procedures are the immediate 
scientist, both in his conduct of an inve 
theorizing about the findings. 


concern of the individual 
stigation and in his subsequent 
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material or structural elements may be associated, as when we associate 
blond hair with a fair, delicate skin. Resemblances between functional 
aspects may be associated, as when we associate poor ability to drive a 
car at night with a slow recovery rate of the retina of the eye following 
a bright light stimulation. Resemblances between structural elements and 
functional aspects may be associated, as when we associate a short, stocky 
physique with slowness of movement. 

Resemblances as the Basis of Scientific Classification. This process of 
discovering resemblances between natural phenomena underlies one of 
the scientists most important procedures, that of classification. Once 
the scientist has found some resemblances among objects, the postulate 
leads him to expect and to look for additional resemblances among these 
objects. This expectation of further resemblances leads to the postula- 
tion of resemblances that are in need of investigation. The investigation 
may demonstrate that the postulated resemblances actually exist, or it 
may demonstrate the existence of differences or dissimilarities which in 
themselves may be very important items of knowledge. 

When resemblances are sufficiently numerous to treat the objects as a 
group by themselves, the scientist forms a class and gives the group a 
class name. Some examples of common class names are igneous rock, 
radio tube, poisonous gas, cosmic ray. visual perception, personality 
trait, physiological drive, etc. The delineation of a class is based on the 
number of resemblances or the degree of resemblance, since members 
of any given class do not resemble each other in every possible respect. 

The Importance of the Postulate for Science. The postulate is impor- 
tant for science in terms of the effect it may have on the scientist’s think- 
ing. Regardless of whether or not the scientist actually verbalizes the 
postulate, he does have the expectation of finding additional resemblances 
when he is conducting a study. It is the effect that this expectation may 
have on his reasoning that is important. It may lead him to attribute 
significance to resemblances that are unimportant. Thus his reasoning 
and the subsequent inferences that he makes will be in error. Generaliza- 
tions based on unimportant resemblances usually are fallacious. 

The resemblances on which any reasoning is based should be pertinent 
to the nature of the element or aspect being inferred. Both the number 
and the importance of the resemblances determine the accuracy of the 
inference. The reader can call to mind examples like the following, in 
which the resultant inference based on analogical reasoning is in error. 
Certain primitive tribes make a wax figure of the enemy in the belief 
that as the figure melts away, the body of the enemy will waste away. 
These aborigines believe that figures that have the same shape are 
somehow physically associated. Similarly we find that in some tribes the 
warriors eat the heart of the wild game killed in order to become stout- 
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hearted. Zulu tribesmen chew wood to soften the heart of their enemy. 
These examples illustrate the fact that error results when the inference 
is based on superficial resemblances. 

The scientist is subject to these errors and must maintain a critical 
attitude in making his inferences. When human individuals are his con- 
cern, the chances of error greatly increase because the number of poten- 
tial resemblances are legion and the intricacy of their interrelationships 
continue to defy understanding. We should be reminded that much of 
the reasoning underlying the pseudoscientific schemes referred to in the 
last chapter is based on superficial resemblances, 


THE POSTULATE OF PERMANENCE 
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The Importance of the Postulate for Science. In the logic of science, 
the postulate of permanence must be accepted or science is impossible. 
Obviously, without permanence natural phenomena would show no con- 
sistency in time, and this consistency is basic to the procedures of science. 
Without permanence the aims of control and prediction would become 
meaningless. It is difficult to imagine just how the objectives of science 
could be achieved if the predictions of the future rested solely upon 
chance, which is the alternative we must accept if we deny the postulate 
of permanence. 

In accepting the postulate, science does not demand absolute perma- 
nence. Rather it is implied that natural phenomena change so slowly 
that the scientist can learn about them, and that in spite of his slow rate 
of progress what he learns remains usable long enough to be put to 
profitable application. 


THE POSTULATE OF DETERMINISM 


The Meaning of the Postulate. Science accepts the postulate that all 
phenomena are determined. All of nature’s manifestations are resultants 
of fundamentally stable processes at work and do not issue from caprice. 
“Things don’t just happen.” Each occurrence has a beginning in the events 
that have preceded it and to which, at least theoretically, it eventually 
can be traced. There is a temporal relationship between an event and its 
precursors, and this relationship supplies us with information about the 
occurrence of the event. The postulate of determinism, then, affirms that 
every event has a history in the events that have preceded it. 

Opposed to this supposition is the postulate of indeterminism or non- 
determinism, which states that events just happen. Events are not related 
temporally to other events. The idea of spontaneous generation illus- 
trates indeterminism. 

The concept of determinism is very widely accepted, most persons 
being strongly opposed to the idea that events occur by chance. We not 
only apply the postulate to our individual behavior but also consider it a 
fundamental assumption underlying all of the social institutions that 
have been established. 

Kinds of Determinism. The interpretation of the nature of determinism 
varies widely among different groups of thinkers. There are three general 
types of interpretation, as follows: spiritual determinism, self-determin- 
ism, and natural determinism. 

In spiritual determinism, forces different from those known to exist 
among natural phenomena are credited with determining behavior. They 
are referred to as supernatural and are sometimes considered to be the 
attributes of a higher Being. The phrase self-determinism is used to refer 
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THE CONCEPT OF IMMUTABLE LAWS 


Some of the early scientists interpreted the concept of the uniformity 
of nature as meaning that nature is governed by a set of general laws 
that have the characteristics of being both universal and changeless. It 
was sometimes maintained that once these laws were understood the 
divergent and multifarious expressions seen in natural phenomena could 
be readily explained. Underlying this conception of universal laws was a 
very strict interpretation of the immutability or changelessness of natural 
phenomena. 

There is so much evidence now available to show that natural phe- 
nomena are not constant that the concept of absolute immutability is no 
longer widely accepted. It is not essential to science that the laws gov- 
erning nature be immutable in any absolute sense. As already pointed out, 
it is only necessary that the changes be slow enough to allow the scientist 
to make profitable use of his findings. Furthermore, it can be shown that 
so-called “changeless” laws are not true in any absolute sense, but are 
only approximations of the truth. They can be considered as correct only 
within the range of approximation set by the limits of accuracy of the 
measuring instruments. In reference to their universal nature, it should 
be pointed out that, logically speaking, they are not really universal laws 
because they have not been tested over the complete range of the vari- 
ables that they are intended to encompass. They are, then, only probable 
truths, 

For our purposes the important point is that natural phenomena, in the 
main, change slowly, and consequently the laws governing their expres- 
sion need to be changed only infrequently. We need to recall that science 
allows for, expects, and encourages change of any kind. Even the laws 
issuing from previous scientific studies must come under further scrutiny 
and be changed if the evidence so indicates. 


SOME PRESUPPOSITIONS CONCERNING THE RELIABILITY OF 
THE KNOWING ACTIVITIES 


In the foregoing discussions it was pointed out that in the logic of 
science certain presuppositions about natural objects and events must 
be accepted before science becomes possible. We shall now consider 
certain general notions concerned with the knowing activities of perceiv- 
ing, remembering, and reasoning, which are the psychological processes 
used by man in gaining knowledge about the universe. These processes 
are subject to error. Because they are a bona fide part of the scientific 
method, error introduced through them will be reflected in the generali- 
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The Problem. All of us have experienced some of the common illusions 
in the field of visual perception. These illusions clearly illustrate the 
unreliability of perceptual processes. For example, when we look down ; 
railroad track into the distance, our eyes tell us that the rails come to 
gether. From other knowledge about railroad tracks and railroad cars Me 
know the rails do not come together, and so we are able to make re 
responses despite the misinformation given to us in perception. The we 
is, however, that in the perception of the tracks the two rails do mi 
together, and we can make no adjustment with our eyes that will enab i 
us to see the rails otherwise. Our perceptual processes must, then, b 
considered as providing us with erroneous information. P 
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be willing to subject our perceptual findings to whatever checks will aid 
us in maintaining the highest possible degree of accuracy. 

The Nature of Perception. In order to discriminate the true from the 
false among perceptual experiences, the scientist should understand the 
nature of perceptual processes. An act of perception involves sense-organ 
activity, attentive adjustments in the muscles and in thought, and the 
consequent arousal of meanings, 

During waking life the individual is being continuously stimulated in 
a variety of ways, hence various kinds of Sensory activity are occurring 
simultaneously. In the resulting sensory experience, however, stimulating 
objects are not observed as a jumble of isolated events but as events in 
relation. Nature has provided us not only with sensory structures keyed 
to different types of energy change in the environment, i.e., eyes, ears, 
taste buds, etc., but with a nervous system through which the diverse 
Sensory stimuli are brought together and integrated into various complex 
arrangements. As we introspect upon our experience at any given mo- 
ment, this experience appears to us to be unitary in nature. 

The responses of an individual are being channeled or directed to 
Some degree all of the time. Even when we think a person's behavior is 
aimless, it is actually being channeled toward certain ends, but in ways 
that we either have not discovered or do not understand. Channeling of 
response involves three general kinds of sets. First, there is the direction 
of the sense organs, To obtain a clearer experience of a stimulus we 
adjust the sense organs involved; e.g., we adjust the eyes to see an object. 
This is a receptor set. Movements of the sense organs and other parts 
of the body that may be involved in perception are accomplished by 
means of muscles; therefore there are motor or muscular sets. A third 
type of set involves the channeling of thought activities, i.e., mental sets. 
For example, when we pick up a book to read, our purpose sets us to 
look for the meanings of the printed words. If upon reaching the bottom 
of a Page we find we do not have the faintest notion of what we sup- 
Posedly have just read, then the mental set has not been focused on the 
meanings of the printed symbols even though the sensory and motor 
Sets may have been appropriately made. 

As we learned in the preceding chapter, facts are often composite 
experiences in which meanings obtained directly from the stimulus are 
Combined with meanings rearoused from past experience. We find these 
two kinds of meanings in all perception. Meanings which originate di- 
rectly from the functioning of a sense organ are called sensory meanings. 

eanings which involve the rearousal of past experience are called in- 
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transparent, square, or has height, we are verbalizing sensory 5 
When we say the ice is cold, smooth. or wet. we are verbalizing in 3 
ential meanings. If we now touch the ice, the meanings of cold, smoot i; 
and wet are then sensory in nature. Perceptual experiences are always 
made up of both sensory and inferential meanings. i 

Perception in the Work of the Scientist. It is not necessary here = 
describe how perception enters into the work of the scientist. It gain 
be obvious that he constantly uses perception in all phases of his work. 
Accuracy in both sensory and inferential meanings is a prime requirement 
for his success. 

The scientist’s present perceptions are, of course, conditioned by the 
perceptual experiences of earlier occasions. We must remember that the 
inferential meanings of present experience originate in the sensory mean- 
ings of earlier experiences. Inaccuracy in these previous sensory meni 
ings may, then, have both profound and prolonged effects upon later 
experiences. In regard to the scientist’s work, this means that the mean- 
ings he manipulates in connection with 
tioned by the accurac 
ceding years. 

Sources of Error in Perception. Som 
traced to a failure to make accurate l 
volves discrimination. Perceiving an object requires that it be discrim- 
inated from its background and from surrounding objects. Different 
stimuli are often very similar in many of their characteristics and so may 
be reacted to with the same inferential meanings and thus confused. A 
good example of this type of error is the incorrect identification of iden- 
tical twins. Unless a person is aware of a difference in the twins and can 
correctly perceive that difference on a given occasion, the chances are 
50 to 50 that he will commit an error in identification. 


Error may result from a failure to react to all of the direct stimulus 
contents that are significant for our purpose, 
particularly important for the scie 
ency of an individual to direct h 
much significant stimulation f 


g . Anes U i- 
any given experiment are cond 
: : j re- 
y of the perceptions he has had during many pr 


‘ 8 e 
e errors of perception may b 
55 option in- 
discriminations. All perception 


This is a source of error 
ntist. It refers to the very common tend- 
is attention to such a narrow field that 

ails to be appreciated and thus fails to 
influence him. Many meanings are possible in any perceptual situation 
because many details are present to the senses and there are many rele- 
vant meanings in the perceiver’s past experiences. There is interaction 
between the different stimulating aspects of any given situation, and 
sometimes one aspect may dominate and prevent response to some other 
aspect. Some of the factors that condition this failure to respond to all 
pertinent aspects of a situation are strong goals, preconceived mental 
sets, intellectual biases, emotional prejudices, distractions of the moment, 
etc. For example, distractions of the moment in large measure dominate 
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the stimulating situation created by a ventriloquist. Paying attention to 
only the actions of Charlie McCarthy and not allowing certain other 
sensory stimuli to enter the perceptual activity produce a very real 
illusion. 

Error in perception may result from a failure to identify the source 
of the meaning. Sometimes we mistake an inferential for a sensory mean- 
ing. Inferential meanings, of course, have many of the characteristics 
possessed by meanings gained directly from sensory stimulation. They 
involve color, shape, sound, warmth, movement, and the like. It is not 
difficult to see, then, why at times the two types of meaning are confused. 
If the inferential meanings dominate the perceptual experience, then the 
individual may not adequately react to the sensory stimuli present and 
thus may lose the correcting influences that these stimuli might otherwise 
contribute. In hallucination the inferential meanings are dominant in 
controlling the perceptual situation. They are so intense that they are 
interpreted as sensory meanings by the individual. Thus to the halluci- 
nated alcoholic the spiders. pink elephants, or snakes he sees on the wall 
appear real and are not interpreted as merely figures on the wallpaper. 

Means for Preventing Errors in Perception. The scientist should train 
himself in specific perceptual skills. Perception is not a general process 
but many specific processes. It involves many specific sensory, attentive, 
and thought reactions. Training in a certain area cannot be expected to 
spread over and affect perceptual responses in all other areas. Training 
should be directed to specific types of problems and involve the develop- 
ment of specific anticipatory sets. For example, if the scientist’s problem 
is going to involve the use of the microscope, then he had better spend 
Some time learning the many little perceptual tricks involved in operating 
this particular aid to vision. If the individual is going to work in the field 
of labor-management relations, he had better train himself in the specific 
Perceptual situations of this field. 

Training in background meanings is important. When one is searching 
for something new, as is the scientist, his success is directly predicated 
upon his being able to bring to bear many past experiences through 
the manipulation of inferential meanings. The new has to come from 
recombinations of the old. It is, then, very important that there be a rich 
and full store of relevant information. The more information the scientist 
has concerning the general area containing his problem, the greater are 
his chances of hitting upon a recombination of the old that will solve the 
problem. 

, Errors in meaning or failure to get all the significant meanings in a 
situation may result from the functioning of emotional and intellectual 
biases, Emotional biases are based on the personal feelings of the indi- 
vidual and are very likely to distort his perceptual activities. Whenever 
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an individual supports his beliefs in the face of 5 5 
the contrary, an emotional bias is to be suspected. Such a 1 
selective agent in perception. Consequently, the Person wi ' $ “pee 
to develop and retain those meanings that agree with his bigs ae poe 
meanings in disagreement with the bias despite any truth tha y 

n. E 
p- scientist should train himself against harboring 5 N 
He usually is not a person who becomes emotionally involved jet 
defense of his findings, but at times he may take his theories too > a 
and allow them to blind him to the possibilities of alternative cane p 
This is an example of intellectual bias. Darwin is reported to have . 2 
posely formed an intellectual bias against his theory of pone 
order that he would not fail to include any and all facts that were 

ry to his views. F 

gp“ scientist can take several simple steps to reduce the rant 
influence of emotional and intellectual biases. He can thoroughly a ee 
himself with all the points of view concerning his problem and B 
particularly well those views which are in opposition to his own. ime eo 
purposely examine his own thinking for bias and encourage his — 
to be free with their criticism. He can purposely restrain himself 3 
“overworking” the theory that he is championing, and refrain from Ov 


17102 
generalization even when the theory is richly supported by empirie 
findings. 


THE POSTULATE OF THE RELIABILITY OF REMEMBERING 


The Problem. We are all familiar with the tricks that memory a8! a 
ever playing on us. The failure of the busy husband to remember ifn 
groceries correctly, bringing home beans when peas were wanted; 12 
failure of the student to remember the right material to study for a Kare 
spending most of his time studying irrelevant content; the failure of 85 
physician to remember correctly the symptoms of the familiar disease 


ca to 
diphtheria, diagnosing it as eczema; these are examples of a frailty 
which the scientist also is heir. 


Here again we have a problem 
we ever determine when the reca] 


ad is in- 
e original event, and memory is 
riginal event. 


is liable to make mistakes in recalling 


volved in the identification of the o 
The scientist must admit that he 
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past events, but in spite of this limitation he must postulate the funda- 
mental reliability of memory. He would make little progress if he 
seriously questioned the accuracy of recall of every event. Knowing that 
memory may be in error, however, he can devise aids by which to increase 
the accuracy of remembering and thus minimize possible errors. 

Remembering Dependent upon Learning and Retention. Remembering 
is part of a larger response. When an individual remembers that the 
diagonal of a square is equal to the square root of twice the square of one 
side, two other conditions must previously have been met. First, learning 
must have occurred—the rules about square figures, powers, roots, hy- 
potenuses, and the like, were acquired. This process of learning the rules 
involved changes in the physiological mechanisms of the body, prin- 
cipally in the nervous system and particularly in the brain. Secondly, the 
neurological changes must have persisted over a period of time in such 
a form that when appropriate stimuli were subsequently presented the 
mechanisms were rearoused to activity. This second phase, the temporal 
persistence of the neurological changes, is called retention. Remembering 
may, then, be adversely affected by factors that adversely affect either 
the original learning or the retention. Whether or not some item of the 
past will be remembered, to what extent it will be remembered, and in 
what form it will be remembered are, therefore, dependent upon a 
variety of factors. 

Remembering in the Scientist’s Work. As with the activity of percep- 
tion, remembering enters into all phases of the scientific method. The 
Scientist is continuously manipulating the meanings of the past. Through- 
Out an investigation, whether it is an experiment, a simple interview, or a 
complex survey, he is recalling information to apply to his current needs. 
Although the scientist has constructed many successful mnemonic aids, 
his work is still greatly dependent upon the effectiveness of his memory. 

Recall of the ideas of others and of his own from previous occasions is 
especially important in the beginning of a study, when he is trying to 
devise new concepts. The new always comes as a reconstruction of the 
old, and therefore it is important that he have at his finger tips a large 
background of information. During the conduct of the study, memory 
May prove very vital even to “making or breaking” all previous accom- 
plishment. For example, during a crucial run in an experiment the 
apparatus may start to falter and just a small item of technical informa- 
tion may be all that is necessary to keep it functioning. Without this bit 
of information the crucial run might fail, resulting in the loss of all of the 
experimental data up to that point. Again, in the final interpretative phases 
of a study, when the scientist is extracting all the meanings he can from 
the data, his work is greatly facilitated if he can readily recall the back- 
Sround meanings that are relevant to his findings. 
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Errors of Remembering. Defects of remembering may be classified 
under two headings; namely, failure to remember, which refers primarily 
to the quantitative aspects of recall; and change in fidelity of the oa 
response, which refers to qualitative differences between the rememberec 
response and the originally learned response. Psychologically, this is an 
artificial separation. Actually, the same processes that underlie what is 
recalled also underlie the degree to which it is recalled. Some errors can, 
then, be classified under both headings. : 

Failure to Remember. Failure to remember is in part due to poor initial 
learning. The more efficient the original learning, the higher the chances 
will be for effective recall. 

Failure to remember may be due to the fact that the patterns in the 
nervous system, basic to the response, have not been kept intact over 
the period of disuse when no recall was attempted. Competing responses, 
learned later, may act to prevent the desired recall, that is, some more 
recently learned item may inhibit the recall of the desired item. We have 
all had the experience of trying to recall someone's name and of failing 


; e k is 
because of the persistent remembrance of another name that we know i 
incorrect. Later learned responses, of course 


of an earlier learned response. 


Failure to recall may be due to a lack of sufficient stimulation at the 
time recall is attempted. Learning of an event does not occur in isolation; 
it is initiated and enhanced by a large number of associated stimulating 
conditions. The more adequately these associated stimulating conditions 
are represented in the recall situation, the higher the chances that the 
recall will be successful. The associated stimuli will vary in kind, in 
number, and in intensity, and therefore the recall will be a function of the 
pertinence, number, and intensity of the associated stimuli that are 
reproduced, 

One further reason for failur 
is that remembering is selectiy 


may also facilitate the recall 


e to remember may be offered, and that 
e in nature. Not all aspects of a response 
are remembered on a given occasion because not all of them are congru- 
ous with the p ives of the individual at the time 
: “tall tales” heard around the campfire 
after a day of fishing contain elements that are meant primarily to im- 
i adhere to the truth, Here memory }§ 
selective because of the Purposes of the teller, and as a consequence the 
5 ent all aspects of the experiences being 
described. 
Selectivity of remembering is al 


i So traceable in part to the selectivity 
present in the original learning, w 


here not all aspects of the experienc 
may have been equally well learned. 
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Errors in Fidelity of Recall. These errors are of several kinds and stem 
from different sources. The most characteristic feature of these errors is a 
change in the meanings involved. This change seems to arise from the 
influence of strong mental sets and purposes on the part of the individual. 
The remembered event is made to harmonize with some other thought or 
feeling. There is ample evidence that the interests, attitudes, and expecta- 
tions of the individual do determine the nature of the meanings elicited 
in later recall. The error usually is one of distortion, the recalled mean- 
ings being different from those established at the time of the original 
learning. 

In his effort to confirm a theory, a scientist, in recalling the relevant 
findings of other scientists, may recall clearly those facts that support 
his own notions but may hazily and inaccurately recall those facts that 
are contrary. Thus a false interpretation is placed on the works of other 
Scientists, which may be carried forward to influence his work right up to 
the completion of the project. The scientist faces a situation similar to 
that of a witness in a courtroom. It is clear that when a witness is before 
the court describing a former experience, say his observation of an acci- 
dent, he is not describing the accident, nor is he describing his perception 
of the accident—he is describing what he can remember of his percep- 
tion of the accident. Similarly, when a scientist refers to the findings of 
another investigator from memory, he does not reproduce the written 
descriptions of the experiment as furnished by this investigator, nor does 
he reproduce exactly his actual perceptions of the investigator's report. 
He gives a description of those impressions of the other investigator's 
report that his own selective perception and his own selective and limited 
memory make available. Thus there is considerable opportunity for dis- 
tortion, 

Obviating the Errors of Remembering. To reduce or to eliminate errors 
in remembering, sound psychological principles must be followed during 
the learning, during the intervening interval, and during the recall situa- 
tion. Some of these principles are expressed in the following rules: (1) 
Set an accurate and comprehensive perception of the event to be learned; 
(2) learn all of the aspects of the event that are judged to be important 
and necessary; (3) learn with the intent to recall and later utilize; (4) 
overlearn; (5) make a record of those parts that are particularly difficult; 

6) review; and (7) reproduce at the time of recall the stimuli that were 
Present in the original and review situations. 

The first two principles assist in getting thorough coverage in the origi- 
nal learning and in putting the various aspects in their proper perspec- 
tive. Selective learning can then be so controlled that bias and distortion 
do not enter the learning. Adherence to these two rules will aid in get- 
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ting higher fidelity in later recall. In the third step, the event is ein” 
lated into the purposes, needs, and expectations of the scientist. As 0 
sult, the event has more significance for the scientist, who thus bene n 
from the fact that it becomes an important element in his motivationa 
schemes. This enhances recall. i d 

The fourth rule refers to the need of learning well whatever is lernen 
that is, not stopping the learning process at the. point when an immediat 
recall is deemed satisfactory. The event must be overlearned if reca 
following long delays is to be satisfactory. Overlearning clarifies and a 
tensifies the meanings of the event and presumably intensifies and makes 
more stable the neurological changes occurring during the learning. 
This increases the chances that future relevant stimuli will prove 115 
quate for reexciting the underlying nervous mechanisms. The filth ms 
deals with the strengthening of those aspects which are particularly di i 
cult and which, if not given this special attention, might result in 
complete forgetting of the whole event. The sixth rule demands a jel 
ing up or relearning of the event. This step serves, as no other step 1 
to preserve intact the changes in the neurological mechanisms which 
underlie the event and which make recall possible. all 

The seventh rule is concerned with the stimuli present at the time reca 
is attempted. As already stated, the success of recall is a function of e 
current stimuli, inasmuch as they serve to elicit the rearousal of the pas 
experience. At the anticipated time of recall, the scientist should — 
duce all the aids he can muster in the form of sensory and ideationa 
stimulation. These aids include written notes, records of data, photo 
graphs of apparatus, and similar helps that will contribute to eliciting 
the desired response. ? 

A final statement should be made concerning the use of memory aids: 
All of the many types of recording gadgets employed by the scientist ua 
aids to memory. Polygraphs, photographs, recording meters, recardinh 
clocks, calibrated containers, written records of subject's responses such 
as test booklets, written accounts logging the progress of the study, diaries, 
interview records, and other kinds of records of observations aid in 5 
constructing with high fidelity the particular situation being subjectec 
to analysis. Some of these aids get their value from the fact that they 
result in an immediate record of the occurrences (e.g., sound recording 


of conversations in an interview), which in most instances is superior to 
the descriptions obtainable at a later time by means of recall. Many of 
these aids provide a more accurate picture of the particular aspect they 
encompass than the scientist can provide from memory, e.g., the photo- 
graph of a microscopic slide depicting the reproduction of bacteria. Cer- 
tainly, the scientist should avail himself of any aid that will increase the 
fidelity of his remembrances, 
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THE POSTULATE OF THE RELIABILITY OF REASONING 


The Problem. Thinking and reasoning are frequently used as synony- 
mous terms. Of the two, thinking is the more inclusive, reasoning being 
one form of thinking. Other general forms of thinking have been given 
the names of recollecting and imagining. Our primary concern is with 
the trustworthiness of reasoning. 

The problem is reduced to the question: Is the reasoning of the scientist 
trustworthy? As the reader must suspect, the answer is similar to those 
given for perception and memory. Reasoning is not completely trust- 
worthy, being subject to many kinds of error and sometimes resulting in 
false products even in the hands of the expert. Despite its limitations, 
however, we cannot accept the skeptic’s view that reasoning cannot be 
trusted. His point is untenable because it is through reasoning that he 
arrives at his conclusion that all reasoning is untrustworthy. We must 
accept reasoning as fundamentally trustworthy but be willing to subject 
it to any checks that will assist in preventing inaccuracy. 

The Nature of Reasoning. Reasoning is a form of problem solving in 
which the manipulation of ideas is substituted for the manipulation of real 
objects, as, for example, in overt trial-and-error behavior. It is the process 
of attacking a problem by means of concepts rather than by means of 
overt responses. Reasoning involves the use of many psychological proc- 
esses, including symbolizing, remembering, imagining, comparing, ana- 
lyzing, synthesizing, abstracting, inferring, generalizing, etc. 

If the individual is going to respond to an object when it is absent as a 
sensory stimulus, or if he is going to manipulate thought objects not even 
having perceptual counterparts, he must have some mechanism for repre- 
senting these objects to himself. The mechanism that has proved most 
effective for most individuals is language. Objects and relations are repre- 
sented in thought by means of words. The word becomes a sign of the 
meaning that the object has for the individual and serves as an ade- 
quate surrogate. 

Reasoning is made very effective through language because of the 
apparently limitless possibilities for meanings of the latter. The objective 
of all reasoning is to re- form past meanings into new combinations. The 
result is new meanings. Language offers, through its alphabetical symbols, 
almost limitless possibilities for representing the new meanings as they 
are developed. 

Through concepts we can manipulate objects in ways that are impos- 
sible in terms of actual objective movements. For example, we can think 
of pushing over the Golden Gate Bridge. We can think of an object out 
of its concrete context, as when we think about dogs without referring 
to specific dogs, in specific locations, on specific days, belonging to specific 
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people. Again, we can change and elaborate meanings of objects 8 
those known in perception. One of our highest achievements in thin me 
is to form meanings that are so far removed from our perceptual experi- 
ence that it is impossible to find any perceptual counterparts for thiam: 
Such ideas of this higher order are represented by the concepts of an 
endless progression, n-dimensional space, infinite time, and the like. 5 

The concepts in thinking may issue directly from percepts, as when we 
note the relationship between parts of a mechanical puzzle; or stem from 
recollections, as when we remember the destruction caused by the big 
flood last spring; or involve imagination, as when we think about creating 
a device for automatically registering man’s thoughts. — 

Reasoning in the Scientific Method. As is true for the activities of per- 
ceiving and remembering, there can be no science without reasoning. 
In the beginning, reasoning aids in delimiting and framing a problem. 
Through it the irrelevant elements are eliminated and the relevant factors 
are organized in the most meaningful arrangements. After the problem 15 
stated, reasoning is necessary and essential in fr: 19 1 
begins the effortful task of discovering, applying, fitting, modifying, ane 
finally of rejecting or accepting potential solutions. 

Once a solution is tentatively accepted there 
and varied attack in devising methods and te 
that will test the 


aming the solution. Here 


follows a similar complex 
chniques for obtaining date 
validity of the potential solution. During the conduct 
of a study, many problems arise for which there 
tions. The scientist faces problems 
ditions, of app 
the like. 


are no ready-made solu- 
mental con- 
about the order of experimental co 4 
. ror is An 
aratus design, of types of control, of statistical analysis, ar 


In the stage of generalization when the scientist has the data before 
him and is contemplating the 


ing. The significance of the d 
solution for the problem with 
cance of his findings for othe 
the use of reasoning. 

Sources of Error in Reasonin 
errors. Those that are infracti 


ir significance, he again appeals to reason- 
ata for the solution, the significance of the 
which he began his study, and the signifi- 
r related problems are worked out through 


g. Reasoning is beset with many kinds of 
ons of the formalized rules of logic are 
treated systematically in books on logical thinking. Some of these errors 
are the concern of the scientist, and he should become familiar with 
them. Our concern here is to point out some of the psychological sources 
of error. The line of demarcation between the psychology and the logic 
of errors of reasoning will be found to be rather indistinct. 

Errors in reasoning may be due to bias. An intense desire to confirm 
a hypothesis may result in the scientist’s placing an incorrect interpreta- 
tion on some item of information pertinent to his problem. This mistake 
may then be projected into all of his subsequent reasoning. Bias may 
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result in errors of selection, errors of interpretation, and errors of infer- 
ence, all of which are important in scientific reasoning. 

Errors in reasoning may be due to inaccurate judgments pertaining to 
the appropriateness and use of statistical and experimental techniques. A 
striking example of an error in experimental procedure was brought to the 
attention of one of the authors during the Second World War. In a 
government-sponsored project conducted by a reputable biochemist, 
large groups of individuals were exposed to the administration of certain 
vitamins. There was a small change observed in their visual efficiency sub- 
sequent to the use of the drugs; a result that was predicted and desired 
by the investigator. Whether the change was significant or not could be 
determined only by comparing the behavior of the tested subjects with 
that of a control group. For some unknown reason, in the design of the 
experiment no provision had been made for a control group; conse- 
quently, the findings had no general value. This was indeed a serious 


error in judgment. 

Errors in reasoning may result from assigning incorrect meanings to 
words or from allowing the same word to carry more than one meaning 
during a series of logical deductions. The following familiar example 
from logic illustrates change in word meaning: 


Light is given off by the sun, 
Feathers are light, 
Therefore, feathers are given off by the sun. 


Although the error is very obvious in this example, at times word-meaning 
changes occur that are not readily discernible and escape detection by 
the scientist. 

Suggestions for Avoiding Errors in Reasoning. Certain rather immedi- 
ate checks can be used by the scientist in detecting and removing errors 
in his reasoning. One is the formal check of logic. Much of the scientist’s 
reasoning can be subjected to the formalized pattern of syllogistic reason- 
ing. Having arranged his facts and arguments in the appropriate form, 
he can subject them to the rules of logic and thus learn if they fulfill the 
demands of correct reasoning. 

Several other checks should be made routinely. The scientist should 
examine the nature of the assumptions that underlie his problem and note 
what bearing they have on the specific outcome that he anticipates from 
his study. He should carefully examine the relevance to his problem of 
the evidence already available. He should make certain that he has in- 
cluded all of the known facts in his own statement of the problem. Before 
being satisfied with his own interpretation of his findings, he should ex- 
plore other possible hypotheses. Each of these checks will reduce his 
chances of making errors in his generalizations. 
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Errors due to bias are reduced by developing appropriate attitudes. For 
example, the scientist should exercise an impersonal point of view. His 
acceptance of any evidence should not depend upon its agreeing with 
his hypothesis. Again, the scientist should be tolerant of change, Stereo- 
typed thinking is not scientific thinking, and it will not occur when the 
scientist develops an attitude to look for and accept change. 

Errors of judgment are directly dependent upon the scientist's under- 
standing in the area in which he is working. They are a function of his 
insightfulness and are reduced as his knowledge is increased. In all areas 
where his experience or knowledge is restricted or limited. the chances 
of error in judgment are increased. There is no substitute for wisdom in 
the area of the problem under study. 
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CHAPTER 3 


The General Aims and Methods of Science 


From our consideration of various characteristics of science, we gained 
some knowledge about its aims and methods. A more thorough treatment 
of these aspects is now in order. 

The objective of the scientist is to understand the phenomenon with 
which he is working. He considers that he understands it when he can 
successfully predict its expressions under circumstances somewhat differ- 
ent from those used in studying it, or when his knowledge enables him to 
control its expressions to achieve certain ends. 


FOUR FUNDAMENTAL QUESTIONS 


The scientist asks himself four questions. In connection with any phe- 
nomenon he may query: Is it so? He is here concerned with the existence 
of the phenomenon, that is, whether or not what he has experienced 
has any degree of permanence. He wants to know if he can experience 
it repeatedly and if other observers can also experience it, or if the phe- 
nomenon is an illusion, a fantasy, or a delusion. A second and closely re- 
lated question is: To what extent is it so? This requires an estimate of 
the magnitude, amount, frequency, or some other quantitative character- 
istic of the phenomenon. These two questions fall primarily in the province 
of description. 

Having satisfied himself on these questions, the scientist then asks: 
Why is it so? He is now required to do some “speculating,” to reason be- 
yond the facts that he has collected; to get behind the facts, so to speak. 
Related to this third question is a further one, viz., What are the condi- 
tions that bring about the phenomenon? In certain respects, the answer- 
ing of this question provides some of the necessary information for 
answering the third question; that is, to determine the conditions that 
bring about a phenomenon is to take a first step toward understanding 
why it is so. These last two questions fall primarily in the realm of ex- 


planation and theory. 
35 
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UNDERSTANDING AS THE GENERAL AIM OF SCIENCE 


Understanding and the Search for Truth. One of the most Benet 
statements of the aim of science is: to discover truth about natural 5 t x 
To do this requires knowledge about the events, and this know a 
comes from the experiencing of the events. Experiencing natural p 5 
nomena gives the scientist his facts, and his aim is to discover, accumt 
late, and interpret facts and relationships among facts. oi 

Facts about the natural universe are not isolated events but are pa 
terned and related in diverse ways that in most instances are unknown 
to the scientist. Sometimes facts are initially experienced in ways that 
are meaningful, but most of the time the scientist's effort is expended in 
arranging the known facts in new patterns in an effort to discover = 
known meanings and relationships. Through rational analysis he 1 
the facts into more and more abstract and general systems. The end resu f 
is the formulation of general principles or laws under which all of the 
facts and relationships within some restricted domain of experience * 
be subsumed. The word understanding best expresses the end result OF 
this “search for truth,” and it expresses more accurately than any other 
word the general aim of all scientific work. 

A Continuum of Understanding. Experi 
standing are closely related. The 
tinuum with experience 


ence, knowledge, and under- 
y should be placed on a common 2 
at the beginning and understanding at te: eine” 
From experience we pass through knowledge on our way to attaining 
understanding, There are no sharp lines of demarcation between them, 
They are really three different points or levels on a common axis. 

It is obvious that understanding is more than experiencing. Sameume i 
having experienced a phenomenon on several occasions, an individual wil 
declare that he understands it. He may be in error, and this type of ee 
frequently occurs in everyday life. Mere repetition of the experience 7 
an event does not necessarily result in an understanding of the event. W b 
all can recall having experienced some phenomenon many times, and 2 
knowing little more about it after the last experience than we knew afte! 
the first one. Most housewives do not understand electricity although they 


have used it in many ways over many years, Experience is a first step» 
and an important step, toward understanding—but it is not under- 
standing. 


Understanding is more tha 
toward understanding, By 
thought processes old meanin 
discovered. The end result is 
enlarged, organized, and syst 


n knowledge, Knowledge is a second steP 
manipulation of experience through the 
gs are reinterpreted 
knowledge. This kn 
ematized, and the 


; are 
and new meanings ar 

2 = ＋ 
owledge is then fur je 
end result is understan 
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ing. Knowledge, then, must be integrated and ordered before we have 
understanding. 

The Continuous Expansion of Understanding. Understanding is char- 
acterized as continuously growing and expanding. At the beginning it 
Waits upon experience and knowledge, but after it comes into being in its 
own right it does not remain static. Additional experiences and further 
knowledge increase understanding, so it is continuously evolving. Gaining 
understanding is a never-ending process, because with each increment 
of understanding further doubt arises, and this doubt, in turn, creates a 
need for more experience and knowledge. In some individuals, under- 
standing begets complacency; in others, uncertainty. If we were forced to 
classify the scientist, we would, of course, place him among those whose 
understanding causes them continually to question the status quo and 
thus to seek further experience and knowledge. 


PREDICTION AS AN AIM OF SCIENCE 


The Meaning of Prediction. No scientist is content to stop after he has 
made a discovery, confirmed a hypothesis, or explained a complex phe- 
nomenon. He wants to make some use of his results. He therefore projects 
his generalizations to situations in which he believes they will hold; he 
makes predictions concerning the way the principles he develops will 
Operate in new situations. 

Suppose that in studying the intelligence of a class of eighth-grade 
children we learn that the brightest child obtains a score twice the 
amount of the lowest score achieved. Such a large discrepancy might 
lead us to conclude that the progress in school attainable by the children 
Setting the highest and lowest scores would differ considerably if differ- 
ences in intelligence were reflected in school achievement. We might 
predict that the brightest child could progress faster if given more work, 
or that the social adjustment of the dullest child would improve if he were 
not forced to compete with the brightest child. Thus on the basis of this 
Present knowledge we are forecasting what would take place if we used 
this knowledge in a given specific way. 

Prediction and Understanding. Prediction is based on understanding. 
Understanding forms the springboard from which prediction into the 
unknown is made. In turn, prediction contributes toward the further test- 
ung and verification of understanding. One check we can apply to our 
understanding of a phenomenon is the success with which we can use 
that understanding in new situations. If our prediction is unsuccessful, 
then our understanding of the phenomenon is to be questioned and chal- 
lenged in reference to the particular predicted situation. 

In the aforementioned example, our predictions concerning the im- 
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provement in the rate of progress of the brightest child or ey nig 
ment in social adjustment of the dullest child might turn sae 
successful, thus verifying the application of our findings. We a intelli- 
recommend the segregation of students in school in terms = t = 5 S 
gence- test scores. We could make a broad pesslictio that = ne 
progress and social adjustment of all children would be We O 
were allowed to work with children of their own intelligence * 125 
prediction might then be found in error. The social e iket 
dren might be more closely associated with their ages than wW T lae 
intelligence-test scores. Equalizing the children in terms of wer a 
would exaggerate differences in the ages of the children occupy ag 
same classroom. If there were a very close relation between pee 
social adjustment, we would have to revise our notion about e iie 
all of the children in terms of their intelligence. Regardless of w 15 m 
our prediction turned out to be correct or incorrect, the result 2 ee 
prediction would have a direct effect upon our understanding © 
lem involved. , 

gen Nature of a Prediction. The predicted situation oes 
differs from the predictor situation, Sometimes the difference is pe 
as when a given situation is being duplicated with a minimum of ae 
For example, having determined the learning scores of rats in an N pe 
maze, we might predict a similar distribution of scores for these ra — 
an elevated maze. Sometimes the difference is large, as when 15 
factors are allowed to vary between the predictor and predicted si 


a 
5 6 4 E F 8 rats on © 
tions. Having determined the distribution of scores of some rat! 
brightness-discrimination problem, we mi 


of the visual areas of the brain and predic 
tion function would be lost. In this insta 


duce differences in health and difference likeli- 
ences in learning ability, and consequently our prediction has more li 
hood of failure than if o 


the 
are present force us to accept t 
prediction as only tentati 
will, of course, vary 


> the less 


diction will turn out successfully, 
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The test may be either rational or empirical in nature. In a rational 
test we may show, through reasoning, that the outcome of our particular 
explanation or theory ought to be of a certain kind or have particular 
characteristics. If our prediction concerns relationships that are only 
partially understood, we may be able, through reasoning, to increase 
this understanding by introducing into the relationships additional mean- 
ings that at first were thought to be irrelevant. 

Suppose we are interested in explaining how we see color. We recall 
that there are two kinds of retinal structures, the rods and the cones, and 
that the latter are color-sensitive. We also know from our color-mixing 
experiments that the spectral colors can be obtained from variations in 
the mixture of three colors, viz., a certain red, a certain yellow, and a cer- 
tain blue. Associating these facts together might lead us to the notion 
that there are three color-sensitive structures in the retina, one for each 
of these colors. We now can explain how we see color by stating that the 
light rays differentially activate one or more of these three color-sensitive 
structures in the retina, With this explanation we can now proceed to 
predict what might happen if there were radical changes in these struc- 
tures. We might, for instance, predict that if a person were born without 
any one of them he would be color-blind to certain hues. Or again, if the 
color-sensitive structures were not evenly and uniformly distributed in 
the retina, a person might see certain hues and fail to see certain other 
hues in some given part of the visual field. We are pleased with our ex- 
planation because it enables us to give plausible predictions about other 
events in which we are interested. 

Tests of the rational kind are one of the most valuable tools of the 
theoretical scientist. Although he may not be interested in knowing if his 
ideas have practical value in everyday-life situations, he nevertheless is 
Concerned with any forward reference that his data, explanations, or 
theories may have. Consequently, he finds prediction a valuable aid in 
forecasting in “conceptual space” what can be expected from his ideas. 

In an empirical test, the prediction is applied to conditions in the 
natural world. The relationships stipulated in the findings are projected 
to natural phenomena and these phenomena are then carefully observed 
to determine if these relationships occur according to the demands of the 
prediction. 

Let us consider again the example of the predictive value of our ex- 
planation of how we see color. After having predicted that if people 
Were born without one of these sensitive color processes in their retina 
they would be blind to certain particular colors, we could then explore 
the possibility that there are color-blind persons who fit our predicted 
descriptions. We would first describe (predict) the types of color blind- 
ness that would occur, depending on which color process or combinations 
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of color processes were absent in the retina. For example, 1 T 
contained no red-sensitive process the person should not see rec pete 
if all three sensitive processes were absent the person should . nin 11 
color-blind. Having calculated the various types of blindness. we i 
then study color-blind people and learn if their actual blindness co a 
sponded to the blindness predicted by our color-vision explanation: ** 
found a relatively high correspondence, we would have empirica 
dence supporting our explanation. l rical 

Eventually, all predictions should be brought to some kind of ioe 
test. Empirical conditions offer the most easily understood types of si a 
tion. They usually allow for the coincident observation of the ee 
by many individuals, and therefore increase the probability of reaching 
agreement among different investigators. 


CONTROL AS AN AIM OF SCIENCE 


The Meaning of Control. As an aim of science, control refers to the 
manipulation of the conditions determining 
achieve some desired end. In utilizing 
the functioning of any factor, we are 
understanding. 

Ready examples of the use of control are to be found in the ares 
vocational guidance. Aptitude-test scores have 
rather highly with success in college. 
exercise a more intelligent control ove 
lege training. We can advise 
scores that he should not atte 
might save the person from m 
ties to areas where he would 


in order to 

a phenomenon in a A 
` anti 

present understanding to a 
thus testing and verifying this 


1 of 


correlate 
been found to correla 


From this finding we can ie 
r the admission of students to nnd 
an individual who has very low aie 
mpt college work. In such an instance 210 
any serious frustrations and direct his activ? 
achieve marked success. We would be ‘Id 
ercising control over his behavior, and the resulting achievement . 
serve to verify the understanding gained concerning the relation betwee 
aptitude-test scores and college suce 

Control and Prediction, Q 
ally, the two are inseparable 


ontrol is a corollary aim to prediction. pene 
when interpreted as general aims of scienc 9 
To achieve any prediction, regardless of how simple it might be, an 
control of the determinants of the behavior is required. In the problen 
of predicting college success from aptitude-test scores, we have exercises 
control by permitting only certain kinds of behavior to be expressed, ae 
those behaviors that are elicited by the particular tests used. Likewise 
regardless of the behavior we desire to control, there is always an uncel“ 
tainty about the end result, so whether we consciously make a prediction? 
or not the conditions characteristic of prediction are present. When we 
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speak of the relationship between aptitude and college success in connec- 
tion with the advice we give a high school graduate who is considering 
going to college, there is implied a predictive relation between the two 
types of variables. 

Control and Application. Sometimes the term control is restricted to 
situations of a practical nature, such as the example of segregating school 
children in terms of their intelligence-test scores. This is a narrow inter- 
pretation of the term. Control serves equally well at the abstract and 
theoretical levels on which the “pure” scientist works. His task is to form 
inferences from his theory and to devise new conceptual situations to 
which the theory can be applied. He must logically show how an end 
result of a given kind can be produced by controlling the conceptual 
situations according to the implications of his theory. For example, the 
concept of control can be found throughout Einstein’s theory of relativity, 
although what Einstein did can in no sense be construed to be control 


of practical situations. 
THE EMPIRICAL AND RATIONAL PHASES OF SCIENCE 


The scientist capitalizes on any type of approach that he thinks will 
enhance his chances of gaining knowledge. Some of these involve the 
direct manipulation of the natural phenomena he is studying, whereas 
some of them involve the use of the higher mental processes by which he 
thinks about these phenomena. There are then both empirical and ra- 
tional phases, and the scientific method is an intelligent combination of 
these two types of procedures. 

The Empirical Phases of Science. The meaning of empirical should 
not be restricted to the meaning of “that which is sensed.” Experiences 
of natural phenomena are the first facts the scientist collects. The original 
experiences he gains in collecting the facts are of great significance be- 
cause they are the “stock” with which he “sets up housekeeping.” 

In addition to the sensory experiences themselves, empirical refers to 
the techniques and procedures in which sensory experience plays an im- 
Portant role. Beyond the original observations, empirical features are to 
be found in all of the subsequent steps in which sensory experience is 
Present. For example, the experimentalist, in the designing, constructing, 
and Operating of apparatus, depends heavily upon empirical procedures. 
This is also true for various types of analysis of the data. Frequently the 
Scientist reduces his data to graphic form and through the examination 
of diagrams, figures, and drawings he discovers many new meanings. 

ese are empirical procedures. 

Empirical facts are the point of origin of evidence. The scientist tests 
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his ideas under natural conditions. These tests thus provide the means for 
confirming and justifying all of his work. They are the court of final 
appeal where he must be content to rest his case. 

In the realm of the production of hunches, ideas, and hypotheses. em- 
pirical procedures by themselves are strictly limited in scope. Alone, raw 
sensory experience provides us with only a rather elemental type of mean- 
ing. Mere awareness of an object is very limited. We add little to knowl- 
edge if we terminate our activity at this point. The full import and sig- 
nificance of an experience results from various intellectual manipula- 
tions of the sensed data gained in the experience. Rational phases, then, 
are essential to higher-order meanings. 

The Rational Phases of Science. Whether or not the facts of experience 
eventually add significantly to our knowledge depends upon the kind of 
rational manipulations we perform and the accuracy with which we per- 
form these manipulations. We can study the data statistically or logically: 
we can analyze them into more elemental structures or combine them into 
complex patterns; we can note their similarities or their differences. These 
rational manipulations of sensory and inferential meanings are m 
“heart” processes of our descriptions, explanations, generalizations, and 
theories. Through these manipulations we learn about the relationships 
between other variables and the phenomena under study and the sig- 
nificance that these relationships have for future understanding. i 

The rational phases of the scientific method include all procedures in 
which higher-order meanings are involved and include such individua 
processes as memory, abstraction, inference, reasoning, generalization 
judgment, and the like. 

Certainly, as scientists, we must use rational procedures from the be- 
ginning of an investigation right through to the end. Science particularly 
requires rational processes in the setting up of the problem, in the analysis 
of the results, and in the interpretation of the findings. 

The rational procedures of formal logic are a valuable part of the 
scientific method. As indicated earlier, scientists must practice “straig? 
or sound” reasoning. Logic sets up patterns of reasoning for us to for 


low, patterns which, if followed, will lead us to what are considere, 
logically correct conc 


lusions. Logic aids in the accurate formulation o 
Propositions so they can be evaluated against other possible alterna 
tives. It enables us to state our postulates so their full implications can 
be developed through further reasoning. “Straight” thinking is require 
ntific inquiry, and therefore we should not depre” 


edure that has as its fundamental purpose the ge; 


scription of the conditions for accurate thinking. 
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THE MAJOR METHODS OF SCIENCE 


In an earlier discussion it was pointed out that science can be inter- 
preted as a very general method composed of many important but less 
general procedures. Some of these procedures deserve separate treatment 
because they form the solid core of the scientific method. These major 
methods are symbolization, description, explanation, and theorizing. 


SYMBOLIZATION AS A METHOD OF SCIENCE 


The Meaning of Symbolizing. Symbolization has to do with translating 
experience into symbols. Experience is fleeting; it is here, then gone. 
There is little time for pondering its nature while its sensory compo- 
nents are still manifest. If we are to deal with an experience after its 
disappearance, some change caused by the experience must be carried 
Over in time and must be of such a nature that it can be rearoused in 
memory and manipulated by means of various thought processes. This 
is accomplished through symbols or words. Experiences are given names. 
Every event, and every characteristic of every event, is given a symbol 
or word-tag by which it is known from that time forward. It is this word- 
tag that, so to speak, makes the experience “immortal.” 

Symbolization includes the assignment of names to objectively ob- 
served events, e.g., tables, rocks, houses; the assignment of names to sub- 
jective events, e.g., joys, pains, thoughts; and the assignment of words 
and other symbols to conceptual events created through the thought 
Processes, e. g., infinity, purposes, theories, relativity. 

Language is stressed in symbolization because it is the most widely 
used type of sign. It is not sufficient for all purposes, however, and an 
Mvestigator may need to adopt or invent some other forms of symboliza- 
tion. Two other systems of signs that are of great service are mathematics 
and symbolic logic. 

The Characteristic of Correspondence in Symbolizing. The most im- 
Portant characteristic of symbolization concerns the degree of accuracy 
With which the symbols represent the facts they stand for. The objec- 
tive in science is to develop symbols which can accurately substitute 
for the particular aspects of the events we want to represent; that is, to 
devise symbols that will faithfully signify the meanings of these events. 
tis important, then, that there be some form of correspondence between 
the world of events, on the one hand, and the system of symbols, on the 
other. An ideal arrangement is one in which there is a distinct symbol 
for each of the attributes, phases, qualities, aspects, elements, etc., that 
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can be found in a given class of events. Actually we must be content wi 

much less than this perfect “co-relationship. 


5 5 “i = he 
Symbolizing in Science. We find that symbolization contributes to t 
scientist’s work in two very significant w 


; namely. (1) it makes e 
sible a permanent record of experience, and (2) it furnishes a vehicle 
or mechanism for the rational manipulation of past — =" 

By means of symbols, experiences are retained in varying degre sone 
the individual, so that through his memory he can later reproduce 
for further examination and use. Experiences also are retained ope 
written records. Both of these methods enable the scientist to call ome 
again the meanings of a previous experience. He thus gains an unlimi 
number of opportunities to study the experience, . saan 18 

It will be recalled that the manipulation of symbols in nae 
one of the essential procedures in science. The scientist, by era 
manipulating the word that stands for an event, is doing the — a 
thing to manipulating the event itself. In fact, dealing with an i = 
in thought by manipulating its word meanings is in some respects ar 
rior to manipulating the event as an experience, The actual er ve 
event occupies a single precise point in time and space; the recalled a i 
does not, and can therefore be manipulated indefinitely without respe 
to temporal and spatial contexts. ler- 

The Demands to Be Met by Symbolization. It will help us to = a 
stand what we should expect from symbolization as a method in ae 
if we briefly review three characteristics of natural phenomena that $ 
the demands that must be met by 


a successful system of symbols. p 

The tremendous complexity of natural phenomena sets the most pa 

demand. There is no apparent limitation to the extent to which natura 
phenomena can be 


pa p È 5 studied, 
subdivided and differentiated. Each event studied 
upon closer examination, is found 


larly, each part is found to be 
Truly, on the face of it, we are h 
infinite progression. To be SUCC: 
have unlimited possibilities in 
supply. 

A second demand to be me 


to be composed of parts, and, ~ 
composed of other parts, ad infinitu 
ere confronted with what we can call a 
essful, our system of symbolization ga 
regard to the number of signs it ca 


t by our system of symbols is referable 3 
the characteristic of change. Never is the “same” event exactly the ee 
It is merely treated as constant in order to fulfill some particular purpos 0 
Our system of symbols must be flexible enough to accommodate itse 
to changes in meanings occurring in time. t 
A third characteristic is summed up in the word relationship. 
only is an object found to be divisible into parts, and the object and i 
parts found to undergo continuous change, but the 


e relate obje . = 
bj ct and its sul 
ivisions re found t lated in v 


. her 
ery complex ways with each otl 
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and with other objects and their subdivisions. For the want of exact 
knowledge and a more precise symbol, we shall use the word infinite to 
characterize both the number and diversity of the relationships that exist 
among natural phenomena. We must make our symbols accurately ex- 
press the nature and degree of these relationships. 

The Deficiency of Words as Symbols of Natural Phenomena. Keeping 
in mind the foregoing demands placed upon symbolization, let us consider 
the handicap under which we are working in trying to force natural phe- 
nomena into the system of signs called language—a system that we have 
poorly mastered even at best. We need not question the fact that lan- 
guage is one of the most valuable tools that man has invented, but there 
is need for questioning man’s failure to make a more accurate use of 
the language he possesses. In meeting the demand for increased num- 
bers of meanings, we have not exploited language to the fullest. We 
have been content to allow the same word to do double duty—actually, 
many times more than double duty. We also have been negligent in an- 
other way; we have assigned the same meaning to more than one word. 
Certainly, when we are in need of accurate representation of such a 
r of meanings, it is most inefficient to use the same 


tremendous numbe à 
ral meanings and to make several words function 


word to stand for seve 
as representatives of the same meaning. 

Language is deficient in representing changes in time. In many in- 
stances natural phenomena change faster than the words with which we 
describe them. Much of our language is still in the “horse-and-buggy” 
era. In fact, we persist in refusing to accept the innovations that crowd 
in upon us, and even ridicule the individual who dares to “coin” a new 
Word. 

Language, in the strict meaning of the term, is most deficient in 
regard to the symbolization of relationships. Here there is a severe limi- 
tation on the number of words that are available, so the scientist has 
had to look elsewhere to find a more exact and comprehensive system. 


This he has found in the symbolization of mathematics. Seldom does a 
on words for representing the relation- 


Scientist nowadays rely solely up ; š 5 
bers and their relationships are now a 


ships he wishes to describe. Num 
Prime necessity in science. 


Science has demonstrated throug 
and pictorial techniques that meanings can be represented with preci- 


Sion. It likewise has improved the precision of its descriptions through a 
: age as a vehicle for meanings. Some of this 
increased precision has come from the invention of new words, but much 
Nas been achieved through correct choice and use of familiar words. 

his is a step we all can take to achieve greater accuracy in our lan- 
Suage. We are not expected to invent a new term every time we en- 


h its use of mathematics and tabular 


More careful use of langu 
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counter difficulty in expression. It can be expected, however, that we 


shall exercise increasingly greater care in the selection of the words 
we use. 


DESCRIPTION AS A METHOD OF SCIENCE 


atural phe- 
as isolated events serve no useful pur- 
attempt to symbolize the obvious rela- 


Sensory experiences and involv 
experiences, 


Description serves a bookkeeping function, At the time of the occur- 


is never time for us to analyze it into its 
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The classification assigned a given event or process is to be considered 
as somewhat tentative in nature. Classificatory schemes gradually un- 
dergo change as more and more knowledge is gained about the events 
or processes being classified. What appears on first inspection to be a 
certain kind of property upon a more thorough examination may turn 
out to be quite a different kind of property. 

Description as Seriating. A second form of description is called simple 
ordering or seriating. Seriating requires more knowledge about the 
events than does classification. It requires not only some common char- 
all of the events but that this characteristic or 
feature be known to exist in degrees or amounts, or be arrangeable on 
some form of continuum in a consistent way. If we are studying the read- 
ing ability of individuals then we can arrange our subjects in an order 
according to their speed of reading. The subjects are not only classified 
as being capable of reading, but are arranged in order ona continuum 
of reading speed. Another example, in which the ordering is of a little 
different kind, is the arrangement of geometric figures constructed from 
straight lines. We can arrange the following figures on a continuum in 
terms of the number of their sides: triangle, quadrangle, pentagon, hexa- 
gon, heptagon, and octagon. When the characteristic being RENE is 
a magnitude and is measurable, a basis is available for accurately deter- 
mining differences between the 1 or events and thus a more precise 
classi i me can be developed. n 

2 A third application of description a a 
correlation, In examining a group of objects it is sometimes * mar 
two different characteristics are associated in such a way „ 5 
is present the other is present and when one is absent the 0 si ry 2 1 
absent. The two characteristies are said to occur 1 ‘ 
a relationship is referred to N my a Tu tha ae 65 
variation of eye color and hair color in me e frequently light in color 
individuals with blond hair, the eyes are mor q or dad Sdk. 
than dark, while the reverse of this tends to be true to p 


sessing brown or black hair. i 

Quantitative characteristics may also : 
relationship between the variables of heig 
lation can be expressed in numerical term 


itatively. 
degree or amount can be ane aes Te 
It will be noted that, in genera’, 


jating; is, the facts are 
goes somewhat beyond classification and cee a n attempt is 
first classified and arranged in order of magnitude Detor a 


made late them. ti remembered 
In 8 of description discussed above. bs plist ee sate 
that the feature or characteristic used as a basis for ying, „ 


acteristic or feature in 


e found correlated, as in the 
ht and weight. Such a corre- 
s, and thus its variation in 
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or correlating must be discoverable in the facts or events themselves. 
What actually can be observed or what can be empirically demonstrated 
becomes the basis on which the description proceeds. No recourse is 
made to knowledge that lies beyond the events or to inferences or 
theories that transcend the knowledge gained directly from the events. 


EXPLANATION AS A METHOD OF SCIENCE 


The Meaning of Explanation. As already stated, 
mental objectives of science is to find the reasons for the occurrence of 
events. The scarcity of facts may compel us to resort to higher-order 
conceptual meanings in order to account for the 
studying. In searching for the possible 
we are trying to answer the question: 
fundamental method through w 
of question. 

Explanation proceeds to the discovery of higher-order meanings by 
means of the manipulation of concepts. Symbolization, then, is necessary 
for explanation. The sensed experiences and the meanings derived from 
them must be symbolically represented in verbal or other form and thus 
made available for mental manipulation, 

Explanation involves abstraction. Conceptual meanings depend upon 
the process of abstraction. As we attempt to create new patterns and 
relationships among the facts, reasoning takes us further and further 
away from the factual meanings of description to meanings at higher 
and higher levels of abstraction and generalization. M eanings in the form 
of postulated entities, processes, or relations, which the scientist con- 
ceptually invents to account for his results, are called logical constructs. 

At higher levels of abstraction, explanation becomes theorizing, When 
an explanation effects a pattern of logical constructs as i 
framework into which all the facts relev 
be fitted, it is usually called a theory. 

Let us again refer to the example of e 
three most important empirical meanings are that we see variations in 
color, that the retina of the eye is not stru 
at least two structures. namely, the rods 


are the sensitive structures responding r seen. Explanation 
now enters in the postulation th: é 


structures, each sensitive to ave lengths These 
anes structure in the retina and therefore are 

one step removed, through ab- 
cones. They are 
about the responses 


one of the funda- 


phenomena we are 
conditions giving rise to an event, 
Why is it so? Explanation is the 
hich we discover the answer to this type 


a conceptual 
ant to some phenomenon can 


xplaining how we see color. The 


knowledge 
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of rods and cones. In attempting to learn about the various kinds of color 
blindness, we manipulate these conceptualized retinal structures. The 
are logical constructs that we use to push beyond the empirical fac 
that we now possess about color seeing. 

Explanation and Description. There is general agreement that there is“ 
no sharp dividing line between description and explanation. Explanation 
begins where description leaves off. Both have the fundamental function 
of discovering the meanings of experienced events through the manipu- 
lation of symbols. The primary feature that distinguishes the two is the 
relative amount of conceptualizing involved. As already pointed out, 
the purpose of description is to discover the meanings that are observ- 
able in the sensed data themselves. The manipulation of the data is done 
in ways that issue directly from an observation of the facts available to 
experience. In explanation, the meanings are less observable in the data 
and are discovered through some process of mental manipulation of the 
data. The meanings derived at the descriptive level are further manipu- 
lated in explanation in an attempt to discover additional meanings. In 
our example of color vision, the empirical meanings about the functions 
of rods and cones were manipulated and gave rise to the postulation 
of three different retinal structures sensitive to color stimuli. The mean- 
ings of these three conceptual structures were then further manipulated 
to discover additional meanings about how we see colors. 

Compared to the meanings of description, the meanings of explanation 
are more flexible, that is, they can be more easily changed to suit the 
purposes of the investigator. As a consequence, the meanings of explana- 
tion are more controversial. Procedures of mental manipulation are pri- 
vate to each thinker, and it is often difficult to get these manipulatory 
processes sufficiently similar in two or more individuals to achieve cor- 
ings devised. In the experiencing of con- 


respondence in the final meani i : of cc 
adily as in the experiencing 


cepts, individuals do not see eye to eye as re 

of percepts. 
Referring agair 0 

result from the activation 0 


i to our example, the exact colors which presumably 
f the three postulated retinal structures can- 
not be established to everyone's satisfaction. Different investigators pos- 
tulate different structures because they use different criteria for defining 
them. One investigator may define the retinal structures in terms of the 
three colors which, through mixture, give only the spectral colors. An- 
other may use as his criterion the three colors which, when mixed, give 
both spectral and nonspectral colors. In these two instances the three 


postulated color structures would not be exactly the same. 
The meanings of explanation are subject to less control than the mean- 


ings of description. Being removed by several steps from the empirical 
facts explanatory meanings also are further removed from the controlling 
> 
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influence of experience. This is apparent in the explanations of the chemi- 
cal changes in the cone and rod responses. We know that in rod stimu- 
lation there is a bleaching of a substance called rhodopsin. In cone 
response we are not certain about the existence of a similar substance and 
therefore our explanations of cone response are more variable. Broad 
explanations of behavior, like the instinct hypothesis, are difficult to dis- 
lodge from the layman’s thinking because they are far removed from 
the actual concrete facts known about behavior, 

Compared to descriptive meanings, explanations are more tentative in 
nature. In general, an explanation contains so much meaning that is 
guessed that it must be accepted only as a possible truth. As it receives 
verification through logic and experience it can be expressed as a probable 
truth, and sometimes the degree of probability can be accurately stated, 
depending on the amount and accuracy of the empirical data available. 

The Purposes Served by Explanation. Explanation is directed toward 
increasing our understanding of natural phenomena. It is like descrip- 
tion in that it results in the formation of classificatory schemes into which 
sensed data may be meaningfully organized, but the schemes of explana- 
tion are not readily observable in the data and depend primarily upon 
the reasoning processes. Through manipulation of their conceptual 
meanings, explanation relates variables in terms of their less obvious 
features. It results in the discovery of the more subtle orders that char- 
acterize the relations among natural phenomena. 

Explanation enables us to carry knowledge forward. Explanation re- 
veals the gaps existing in our understanding and sets about to devise 
the necessary conditions that will bridge these gaps. Explanations built 
on past experiences make easier the understanding of present and future 
experiences. Knowledge from the past has to be put on trial. Through 
postulation, this knowledge is modified and formed into explanation, 
which then is subjected to empirical testing. Knowledge is then carried 


forward in time through explanation and is thus used in the gaining of 
further knowledge. 


THEORIZING AS A METHOD OF SCIENCE 


In his attempt to understand nature, man has never been content with 
merely gathering and ordering existential facts. He seems always to have 
a burning curiosity to discover some supposed “final explanation.” An 
examination of the explanations that he has conjured up to account for 
his behavior will show that he has run the full gamut of the explanatory 
continuum, from the factual at one end to the highly imaginary at the 
other. What he has lacked in fact he has readily made up with fiction. 
The suppositions used by older generations seem a bit incredible to us 
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today, but in our own ways we “moderns” continue to call upon little 
understood postulated entities or principles—pixies—to fill in the gaps of 
knowledge, thus giving the appearance that our intellectual armor is 
impervious. 

Older Fallacious Theories of Behavior. One of the earlier explanations 
of the behavior of the feebleminded and insane is illustrative of the pre- 
scientific theories man has held. The early diagnosticians believed that 
a mentally deficient person was possessed of an evil spirit or of a good 
spirit, according to the nature of his behavior. If the diagnosis was of an 
evil spirit, all manner of exorcisms, magic rituals, physical punishments, 
and the like were practiced in an effort to banish the supposed demon. 
More fortunate was the person who was judged possessed of a saintly 
spirit, as he was considered a messenger of the Deity, and his every want 
vas administered to by those who curried his favor. 

Another example of this fallacious explanation of abnormal behavior 
is seen in the early New England conceptions of witchcraft. Hysterical 
symptoms in the form of anesthesias were declared to be of the devil and 
merited the harshest of treatment. Many persons displaying such symp- 
toms were put to death by hanging. : 

Fairy tales are rich in the use of personified concepts as determiners 
of events, Giants, dwarfs, brownies, elves, goblins, and similar imaginary 
Persons, who lade all of the traits of human beings, are conjured up as 
explanatory devices to allay the questioning fears of children or to free 
the parent of the task of giving valid explanations to the thousand-and- 


ong i is offspring. 

aa ir theta of Behavior. Modern man has not freed 
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that has never been adequately explained. It can readily be dismissed 
from the minds of some thinkers, however, by being attributed to human 
nature. Instinct is another tired, overworked old pixy. It seems that every 
type of behavior of which man is capable has at one time or another been 
attributed to instinct. Heredity and environment are always available as 
explanatory devices and are frequently called in to settle disputes about 
the determination of behavior. The unconscious is another modern pixy 
of questionable repute, seemingly charged with about every function 
that the human individual possesses. 

We must recognize that these latter concepts are in use today, soine 
of them playing a prominent role in current explanations of behavior. 
Thus the censor mechanism of psychoanalysis has become the “little 
man” who guards the portals to the dungeons of the unconscious. This 
pixy is most mortal in his behavior; and his frailties, so humanlike, explain 
in turn the behavior of the individual in which he dwells. To character- 
ize concepts such as these as modern pixies is to bring into relief their 
uncritical use as end explanations of human response. It is to level criti- 
cism against those users of concepts who, when they announce that a 
given concept is applicable to some behavior, consider that thereby they 
have fully accounted for that behavior. We do not suggest that their 
concepts are entirely useless to psychology. Rather, we wish to indicate 
the need for clearer thinking with regard to the manner in which their 
concepts are formulated, interpreted, and used. 

Scientific Pixies. A scientist devises theories to understand better that 
which he observes. He does not engage in theorizing merely to satisfy 
some intellectual curiosity. By the use of logical reasoning he deduces 
and formulates from present knowledge postulates through which com- 
mon features and relationships or underlying principles and laws can be 


discovered, thereby rendering more understandable the phenomena he 
is investigating. He devises a rule 


that states the common conditions o 
a group of events, and the 


n under this rule he subsumes the new event 
to be explained. This step of bringing forward knowledge in the form 
of hypotheses to be verified is an essential step in his program to discover 
truth. He makes progress because the insufficiency of old explanations 
stimulates him to evolve new hypotheses, These, in turn, lead to new 


modes of experiment and analysis and thus to the discovery of additional 
knowledge. 


The thalamic theory of emotion is a good 
dure. The familiar association betwe 
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explained. Furthermore, 
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meant the simple fact that the individual can develop means for pre- 
venting impulsive, explosive, and intense emotional expressions, needed 
to be explained. In the thalamic theory the physiological changes are 
considered a bona fide part of the emotional experience. The thalamic 
region of the brain was postulated as a controlling center from which 
issued the nervous impulses that evoked the emotion. Impulses from 
the thalamus excite the sympathetic nervous system and through it elicit 
widespread bodily changes. The cortex of the brain was postulated as 
containing the higher centers through which the thalamic region was 
kept under control. The cortex of the brain is involved in the reasoning 
and evaluative processes by which emotion-provoking situations are 
assessed. Through these processes the thalamus is brought under con- 
trol. Following the enunciation of this theory a large number of experi- 
ments were conducted that have greatly increased our knowledge con- 
cerning the nature of emotional responses. 

Scientific Compared with Nonscientific Pixies. It is interesting to note 
that the pixies of the scientist and the nonscientist are alike in one point; 
namely, they are born of the imagination—they are beyond apprehension 
by the senses. By the process of assumption, these imaginary factors are 
assigned explanatory powers. The scientist postulates entities, structures, 
relations, mechanisms, and the like, which he cannot sense, that is, see, 
. He employs constructs that do not refer to things that are 
duplications of former 


hear, or fee 
actually observable or that are representative pl : 
sensory experiences. These constructs refer to entities or relationships 
of which he postulates in order to understand better the 
actually observes. In a similar way, through his imagi- 
hosts, spirits, elves, 


the existence 
things that he Á 
nation, the nonscientific individual postulates his g 


and gremlins. , l 
More important for our consideration are the points on which the 


pixies of the scientist differ from those of the nonscientist. To begin 
with, scientific theories are not personified. They are not imaginary 
people, big or little, good or evil. They do not have the Ee 
of people. They do not have desires, feelings, or intentions: They do not 
have to be placated as do the gods of primitive tribes. l 
Theories of science are not reified concepts. They do not come alive 
and act, working toward ends and accomplishing purposes. For example, 
in the hands of some psychoanalysts the concept of the unconscious has 
been reified. In their thinking it is no longer a simple tenia idea that 
helps to explain man’s behavior. It is described as if it were 3 per 
son on the inside of the individual, a person with desires and am itions 
at variance with those of the individual. Such an interpretation is not 


far ved from ghosts and demons. O 7 
Scientific peat stem from facts. Although a theory involves entities 
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or constructs that are not observable, the propositions through which the 
theory was devised stem from facts. The scientist defines his theories 
very carefully, assigning them the characteristics that they have in order 
that they may explain the observed events. He is aware that they are 
products of his imagination which he projects into reality. The non- 
scientist is usually unaware of the linkages through which his pixies have 
evolved from empirical situations. 

To the scientist, a theory is a tool of research. It is not an end in itself 
but a means to further understanding, a form of lever by means of which 
he can pry loose more facts. A theory to him is something to be tested. 
It provides various postulates, and from these postulates the scientist is 
able to devise theorems for empirical testing, If he is unable to devise 
testable theorems the theory is abandoned as unproductive. The pixies 
of the nonscientist are accepted uncritically; they are accepted without 
challenge. He does not feel the need of questioning them and sees no 
need of subjecting them to any analysis or test. They do not serve as use- 
ful tools because they do not lend themselves to investigation. 

A scientific theory has a predictive character through which the scientist 
seeks to improve his control over new phenomena. The pixies of the non- 
scientist offer hindsight, not foresight. They are conjured up to account 
for past events and are used by the witch doctor and his modern coun- 
terparts as portents of the future. They are unpredictable. For example, 
the censor of the Freudian psychoanalyst has its own whims to satisfy 


and its “behavior” cannot be foreseen. Being unpredictable, such pixies 
offer no control over future events. 
The scientist controls his th 


and purposes and makes it work for him. In m 
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Marx, M. H.: The General Nature of Theory Construction, in M. H. Marx 
(ed.), “Psychological Theory: Contemporary Readings,” chap. 1, The 
Macmillan Company, 1951. The basic assumptions and elements of theory 
construction are discussed as these are found in the science of psychology. 
Both reductive and constructive types of explanations are considered. 


CHAPTER 4 


Functional Organization among Natural Phenomena 


A knowledge of the functional relationships among variables is basic to 
man’s understanding of nature. Particularly with the scientist does this 
concept of functional organization play a central role in the attack upon 
“unknowns.” His task is to find and describe order in the world of natural 
phenomena. The consistent and stable functional relationships that he 
finds among events he calls laws of nature. These laws are descriptions 
of the sequences of events that he has found to recur regularly. Knowl- 
edge of these relationships enables him to predict the occurrences of new 
instances of such relationships and thus gives him 


a hold upon the future. 
The utilization of functional re 


lationships by both the layman and the 
scientist is predicated upon the postulate of the uniformity of nature. 
Our knowledge of the rel 


ational meanings of events gives us assurance 
that phenomen 


a will continue to happen in an orderly fashion, Thus 
the concept of functional organization, generalized to 
basic to man’s ever 


of the scientist. 


all phenomena, is 
y activity, and it is particularly important in the work 


SOME FUNDAMENTAL RELATIONSHIPS 


When we examine experience, we find man 
of relationships. We find relationships in s 
as adjacency, inside, outside, above, below, under, over, left, right, and 
other similar meanings. Relationships involving time include 
antecedent, consequent, sequentiality, 
relationships underlie the meanings 


tributes, such as quality, intensity, size, complexity. In the realm of 
thinking, concepts are related in a tremendous number of ways. Each 
of the following words refers to a relationship: logical, dependent, con- 
nect, union, equal, affiliated, ratio, comparable. There is a plethora of 
patterns, schemata, linkages, concatenations, transitions, etc., observable 


among event processes whether these are Sensory experiences or rational 
experiences, 


y kinds and many degrees 
pace, including such meanings 
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The terms event process and event sequence are used in order to 
place emphasis upon the dynamic or process characteristic of experi- 
ence. The argument is that experiencing is a function, a process, an 
activity; and therefore what we call our experiences are not static cross 
sections of unrelated events but are ongoing events in dynamic relation. 
The word event can be substituted for these terms as long as an event 
is looked upon as an activity. 

At any one instant we are aware of only a limited number of the rela- 
tionships of a given event process. Purpose and interest dictate our 
attending to specific aspects or contexts and our disregarding those rela- 
tionships of the event process that are not involved in our particular 
problem. 

Temporal and Spatial Organization of Perceptions. Probably the most 
dominant organizational dimension we experience is that of time. All 
of our perceptions of objects are temporally related in some way or an- 
other, Several event processes may be perceived simultaneously or as 
one following another in a sequence. The experiences of the past, when 
represented in the present through memory, make possible a comparison 
of past and present. Furthermore, the perceived sequences noted in the 
present arouse expectations of their continuance or of their recurrence. 
Thus we extend the time meaning forward and experience the idea of 
the future. The meanings of time and of duration are then readily 
induced from our experiences, of event relationships. These common 
psychological experiences of past, present, and future can be accepted 
as facts regardless of the particular way in which we might wish to 
define time. 

Event processes are also experienced as related in space. Almost as 
soon as we are able to respond to an object we experience that object 
as occupying space and being in relation with other objects in both its 
immediate and remote environs. 

These characteristics of time and space are two of the most important 
relationships with which we as scientists must deal. In order to accom- 
plish any given end we must arrange in relation in both time and space 
those events that we think determine the end event we are pursuing. 
We must deal with event processes that are both prior to and current 
with the desired event process, and that are both juxtaposed to and 
Spatially remote from this event process. 

Organization among Concepts. Both layman and scientist inquire and 
reflect about the characteristics of experience and devise relationships 
among conceptual categories in order to make nature more intelligible. 
Concepts arise from the meanings of previous perceptions. By a process 
of abstraction we develop meanings that are not tied to the concrete 
reference points in space and time that bind our perceptions to the here 


58 Some General Concepts about the Scientific Method 


and now. Conceptual meanings are then freed from the perceptual 
events that gave them birth, and through reasoning they are combined 
and recombined in all the diverse relational ways that human imagina- 
tion can conceive. 

A homely illustration will help to make clear this process of concep- 
tualization. Let us consider the concept of “dog.” In perception we 
experience many dogs, all of which are in some way tied to concrete 
temporal and spatial environments, and each of which has definite char- 
acteristics of shape, size, color, speed, noise, usefulness, and other traits. 
In thought, however, we can respond with meanings that are not referred 
to a given dog occupying a specific space at a given time and with a 
given set of characteristics. We can talk about just large dogs, or just 
short-haired dogs, or just those dogs we saw last week at the dog show, 
or just the dogs that we intend sometime to buy for 
ways we can abstract “dog meanings” from the 
tionships within which they were first expe 
these meanings, we can then 
and different relationships. 


hunting. In various 
specific perceptual rela- 
rienced. Having abstracted 
proceed to recombine them in many new 


Man has conceptualized a tremendous number of meanings, and he has 
arranged each of these meanings in a vast number of relationships. It is 
difficult to comprehend the number and complexity of the organizations 
among concepts that are now available to him. Suffice it to state that the 
activity of organizing conceptual experiences is a continuing one, and 
that one of the primary contributions of science is the enlargement of the 
organizational structure of man’s concepts about nature. 

The Causal Type of Relationship. One of the most significant relation- 
ships developed and used by man is the cause-and-effect sequence. It 
occupies a predominant position among organizing principles because of 
the important role it plays in practical experience. For most people it is 
basic to explanation, a description of the causes of an event being 
accepted as a valid and complete explanation. 
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INTERPRETATIONS OF THE CONCEPT OF CAUSALITY 


In the following pages, some of the more widely used meanings of 
causality are presented. It is important that we understand how these 
meanings are supported by evidence. Certainly, the scientist should 
champion any meaning confirmed by empirical findings. These are the 
meanings that will be useful in his attempt to discover and describe 
further orderly relationships among natural phenomena. The three major 
interpretations of the causal sequence are the animistic, the mechanistic, 
and the correlational. 

The Animistic Interpretation of Cause. The animistic interpretation is 
expressed in the statement “All things are full of gods.” The ancients 
ascribed to their gods the responsibility for the changes they observed 
in the world around them. The gods were the creators and the producers 
of events. They behaved as agents. Acting as the cause, the agent pro- 
duced the effect. In later time agents took on other forms, such as elves, 
goblins, spirits, demons, werewolves, and the like. 

This primitive animistic interpretation of cause has almost disappeared 
from scientific writings. To some extent it is still found in the literature 
of psychiatry and psychoanalysis, two disciplines concerned with the 
treatment and cure of abnormal forms of behavior. Following is a repre- 
sentative description from psychoanalytic literature.* It purports to ex- 
plain the causes for the mental ill-health of a young lady. 


To make ourselves more explicit, it will be necessary to say something about 
the elements of the psychic apparatus. According to Freud’s formulation the 
child brings into the world an unorganized chaotic mentality called the id, the 
sole aim of which is the gratification of all needs, the alleviation of hunger, 
self-preservation, and love, the preservation of the species. However, as the 
child grows older, that part of the id which comes in contact with the environ- 
ment through the senses learns to know the inexorable reality of the outer world 
and becomes modified into what Freud calls the ego. This ego, possessing 
awareness of the environment, henceforth strives to curb the lawless id tend- 
encies whenever they attempt to assert themselves incompatibly. The neurosis, 
as we see it here, was, therefore, a conflict between the ego and the id. The 
ego, aware of the forces of civilization, religion and ethics, refused to allow 
motor discharge to the powerful sexual impulses emanating from the lawless 
id, and thus blocked them from attainment of the object towards which they 
aimed. The ego then defended itself against these impulses by repressing 
them. The young lady in question seemingly forgot this whole episode. Had 
the repression continued unabated, she would have remained healthy. But the 
repressed material struggled against this fate, finally broke through (as a 


1A. A. Brill, “The Basic Writings of Sigmund Freud,” p. 12, The Modern Library, 
Random House, Inc., 1938. 
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substitutive formation on paths over which the ego had no control), and ob- 
truded itself on the ego as symptoms. As a result of this process, the ego found 
itself more or less impoverished, its integrity was threatened and hurt. and 
hence it continued to combat the symptom in the same way as it had defended 
itself against the original id impulses. 


It is obvious that in this description the “id” and the “ego” do not 
remain as concepts descriptive of empirical events. They are mystical en- 
tities performing as personalities, clothed with awareness, wishes, striv- 
ings, repressions, and the like. Reified in this manner. they do not greatly 
differ from the animistic concepts of primitive people. 

The Mechanistic Concept of Cause. As the name implies, this inter- 
pretation reduces the cause-and-effect sequence to a mechanical system. 
Objects and events are tied together in a mechanical relationship wherein 
forces are transmitted from the beginning, the cause, to the end, the 
effect. Knowing the objects, relations, and forces, it is possible to recon- 
stitute the antecedent conditions and thus produce the same end results. 
For example, if several bricks are set on end in a row with the distance 
between adjacent bricks being less than a brick’s length, then if one of the 
end bricks is toppled over toward the 
of bricks will be made to fall. 

It is not difficult to understand why force bec 
of the mechanistic interpretation of caus 
work, where the individual is required to exert force in lifting, pushing, 
pulling, jerking, and the like, he identifies force with his own feelings 
of exertion. Seeing similar work performed by m 
fers the concept of force to the machines concerned. Thus the cause of the 
acceleration of one body is ascribed to the force transmitted to it from 
another body. In the study of levers, gears, pulleys, and the like, we are 
concerned with the amount of force that can be transmitted from one 
point to another. Here there is a physical medium extending from the 
cause to the effect. In the field of light, where no visible medium is 
found, the ether is postulated as the means by which the energy change 
can be transmitted from the sun to the earth. This postulate resulted 
from the mechanical notion that there must be some intervening medium 
through which the sun’s influence, as a force, can be transmitted. 

Seventeenth- and eighteenth-century physical science championed 
mechanistic causality. The relationships among events were interpreted 
in terms of mechanical models, Accordingly, transmission of an imper- 
sonal physical force is a necessary condition for a cause-and-effect rela- 
tion between two events. The mechanistic concept also includes the 


meaning of production or generation. The cause Produces or generates 
the effect. 
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For some scientists, then, nature’s ways are mechanical and are to be 
explained in mechanistic terms. Causality is interpreted mechanistically. 

The Correlational Concept of Cause. The Meaning of the Concept. 
This interpretation of the causal concept was developed by scientists of 
the nineteenth and twentieth centuries. Science developed as an empirical 
study of nature, and the scientist directed his attention to characteristics 
that were observable. He devised various concepts by which he could 
classify these characteristics in terms of their similarities, coexistences, 
and sequential relationships. He sought the causes for a given event 
process in the associated occurrences of other event processes. The con- 
cept of causality was then applied to the concomitant variations observed 
to occur among event processes. It became identified with correlation. 

True to his empirical approach, the scientist refers all of his meanings 
to the experienced phenomena from which they are derived. This he did 
with the meanings of causality. He considered it futile to speak of causal- 
ity at a general level without being able to refer its meanings to empirical 
events, Correlational relations among variables were the only observable 
relationships he could find for justifying causal meanings. 

Correlations without a Temporal Sequence. The nature of a phe- 
nomenon is exhibited in, and consists of, the correlational relations that 
this phenomenon has with other phenomena. This viewpoint includes 
relations as sequences or as temporal organizations, but it also includes 
at do not exhibit the sequential characteristic. For ex- 
ample, the physicist has worked out the relationships among current 
strength, electromotive force, and resistance so that if the values of any 
two of these variables are given, the value of the third can be deter- 
mined. In such an example, the idea of the variables being in a temporal 
sequence is of little consequence. The correlational point of view 
allows for causal relations between events that exhibit no sequential 


relationships th 


arrangement. 
Correlation and the Concepts of Production and Force. Although the 


ideas of production and force were essential in the mechanistic interpre- 
tation of causality, they are not important in the correlational interpreta- 
tion. These concepts are considered as inferences the empirical equiva- 
lents of which cannot be observed among natural phenomena. Correla- 
tional relationships do not manifest some peculiar power under which 
nature is compelled to follow a set form in which the cause is the gen- 
erator of the effect. So-called causal relationships are simply descriptions 
of the ways in which nature is observed to be ordered. Event processes 
are observed to be related, and these relations can be adequately repre- 
sented or expressed in the form of correlations. 
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EXAMINATION OF THE CAUSAL CONCEPT IN TERMS OF THE 
DATA OF EXPERIENCE 


That we can manipulate event sequences in order to attain certain ends 
is a fact that cannot be controverted. Disagreement and controversy arise, 
however, concerning the way in which the manipulation of certain given 
factors brings about a desired end in some other factors. The question 
is: How does one event sequence influence another? We have presented 
three general interpretations purporting to answer this question. Most 
laymen accept the mechanistic viewpoint, but there 
number of them who still utilize causal concepts in a way that is remi- 
niscent of the older animistic interpretations. The correlational point of 
view appeals to most scientists, although, again, many scientists use causal 
concepts in the mechanistic sense. With disagreement still prevalent 
actual facts of experience 
ute, 
of a Necessary Connection 
c point of view concerning 
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of experiences. We then react to these artificially abstracted phases of a 
continuous process as if they are observable static occurrences. Actually, 
a given event process flows gradually from its predecessors and, in turn, 
it melts gradually into the event processes which follow it. 

With event sequences dynamically interrelated in primary experience, 
there is no problem of tying events together by causality. We are not 
dealing with static events that we must “glue” together with some mys- 
terious “causal tie.” The problem of how a causal event brings about its 
effectual event is not a valid problem. It is not necessary, then, to explain 
the meanings of such concepts as production, generation, and transmis- 
sion of force, which are advocated to account for this fictitious tie by 
which an effect is bound to its cause. 

Generalizing the Concept of Causality. The Principle of Causality. 
This principle is inferred from the relationships observed among specific 
event sequences. It stipulates that causal laws are applicable to all natural 
phenomena. We learn from experience that when certain conditions are 
present, certain events occur. From this finding, we incorrectly generalize 
that whenever these prior conditions are present the particular events 
“must” follow. Sometimes the phrase “will always” is substituted for the 
word “must.” One terse way in which this generalization is frequently 
stated is: Same cause, same effect. If, as in the mechanistic interpretation, 
the cause generates or enforces the effect, then in order to be consistent 
we must conclude that a given cause must always give rise to the same 
effect. 

Although repetition of the same sequences of events is basic to the 
establishment of this generalization, mere regularity in the occurrences 
of the sequences does not express the full meaning intended by the prin- 
ciple. Regularity is interpreted as a sign of some more fundamental con- 
nection or intimacy that exists between a cause and its effect. This in- 
timacy is referred to as a characteristic of the causal event, but it is 
abstracted from the specific temporal and spatial characteristics of the 
event and generalized to all future occurrences of the event. Thus, having 
determined the cause of a particular event, we conclude that we have dis- 
Covered a fundamental and intimate connection that will hold true for 
the entire class of events of which our specific event is a member. Our 
generalization then carries the certainty of this connection to all future 
Occurrences of events of this class. 

Generalizing Relationships from Primary Experience. Let us again 
examine primary experience and seek empirical evidence for generalizing 
the causal concept. From practical experience we learn to expect the 
recurrence in the future of the patterns and relations we have discovered 
among event processes. We readily learn that certain actions need to be 
performed in order to accomplish certain ends. Experiences are carried 
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over from the past and duplicated in the present. Memory elicits in ys 
anticipations concerning the future. We count on the patterns of past 
events occurring again and operating in the future as we have known 
them to operate in the past. We do not expect exact duplications of these 
events in all of their spatial and temporal relationships, but experience 
gives us considerable assurance that for many events we can often achieve 
close approximations to many of their temporal, spatial, and other rela- 
tionships. 

We may say that given certain antecedent organizations of event 
processes we may expect the occurrence of certain consequent event 
processes. This is not to say that, given similar situations, similar conse- 
quences “must” or “will” follow. Rather, it means that within given simi- 
lar situations we can expect to find similar event process 
similar ways. What we observe when we closely examine a so-called 
cause-and-effect sequence is regularity of occurrence, This is the only 
empirical meaning that we can find. Thus, we are dealing with complexes 
of event processes manifesting definite dynamic rel 
found to recur frequently in nature. 

Our generalizations for the future are based 
continuance of the regularities already discove 
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The use of the concept of causality by the scientist must involve the 
particulars of his empirical studies. The scientist is interested in describ- 
ing nature through observation. He seeks the solution to pragmatic ques- 
tions through empirical studies and experiments. The beginning and end 
of science are found in the observable relationships among phenomena. 
Through his studies, he seeks knowledge that directly or indirectly helps 
to predict the occurrence of a given event process through the manipula- 
tion of antecedent and coexistent event processes related to it. What the 
philosopher calls the “real” meaning or the “ultimate basis” of causality 
is not of primary concern to the scientist. 


SOME MEANINGS OF FUNCTIONAL RELATIONSHIPS 


A Definition of Functional Relationship. Let us begin with a general 
definition of a functional relationship and wait for later sections for an 
explanation of the detailed meanings. A functional relationship is a rela- 
tionship among given event processes expressible as a mathematical 
proposition that provides the basis for a prediction of subsequent in- 
stances of the relationship. Any event process occurs in relation with 
many other event processes, and it is through these relationships that 
we are able to assign meanings to the particular event process that we are 
endeavoring to understand. These relationships are dynamic, not static. 
They form the larger organization of event sequences wherein we can 
observe the genetic history of the given event process. 

Functional Relationships Rest on the Uniformity of Nature. The use 
that can be made of any functional relationship rests primarily on its 
application to subsequent situations. Correlations between event proc- 
esses hold not only for the relational sequences that have been observed 
but are presumed to hold as well for relational sequences yet to be ob- 
served. Correlations evolved in past experience gain in value in propor- 
tion to the extent to which they provide us with means for dealing with 
future situations. As scientists, we are forced to accept the fundamental 
postulate of the uniformity of natural processes upon which these expec- 
tations rest. An overwhelming amount of our experience leads us to be- 
lieve that the universe is not ruled by caprice, but that all events are 
relational in nature. We must count on functional relationships operating 
in the future much as we have found them to operate in the past. We 
must trust that the organization and categorical relations we discover 
are enduring characteristics of the event processes of nature. 

The Functional Interpretation and the Concept of Cause. Some 
Similarities and Differences. Both functional relationships and so-called 
causal sequences are founded on the postulate of a determinate nature. 
In each instance, the concept has been induced from observations of the 
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regularity of temporal sequences between event processes. In the use of 
the concept of causality man has imposed a special kind of order upon 
nature, as shown by such statements as “Every cause has its effect,” and 
“Given the same cause, the same effect must occur.” Accordingly, nature 
“must” or “will” follow a certain prescribed pattern of organization. 

The functional interpretation differs from causality in that it quantita- 
tively describes the relationships observed and applies the particular 
function discovered to—but does not impose it upon—subsequent natural 
processes. The function is interpreted as merely describing how event 
processes might be related in the future if and when they occur again. 
The functional interpretation emphasizes the need of checking any given 
relationship to determine if and how it will occur in the future. 

Let us contrast the two interpretations through an example. We know 
that there is a rather high relationship between reading ability and 
college grade-point average. This could be interpreted to mean that 
reading ability is the cause of college grade points, and that whenever 
the reading ability is high the college grade-point average must be high 
and whenever the reading ability is low the grade-point average must be 
low. According to the functional interpretation, the correlation indicates 
a high statistical probability that a high reading ability will be associated 
with a high grade-point average, but it does not require that this always 


be true. Rather, it invites the use of this correlational function in empirical 


situations to determine just what degree of confidence can be placed in it 
as a predictive tool. 


; The Meaning of Dependency in Functional Relationships. Many scien- 
tists have rejected the concept of cause because they have failed to dis- 
cover any empirical meanings by which agreement can be reached on the 
significance of the concept. In its place they have substituted the terms 


dependent and determined. Scientific experiments demonstrate that the 
occurrence of a given variable de 


certain specific occurrences 
conditions, 
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mined are the meanings of predictable and calculable. The scientist is 
willing to say that a given event process depends upon or is determined 
by other processes if he can successfully predict the occurrence of this 
process from his knowledge of its relationships with the others. He de- 
scribes those conditions among the other processes that must be met in 
order to expect the occurrence of the given event process. The meaning 
of calculable is similar to that of predictable. The scientist is willing to 
say that B is dependent on or is determined by A if through his func- 
tional equation he can calculate the value of B when given a certain 
value of A. Functional equations will be discussed in later sections. 

Functional and Nonfunctional Relationships. Contextual Relationships. 
Any given event process which we desire to study will be related with 
many other event processes. It will be found functioning in a context of 
interacting variables in which the relationships will be both varied and 
intricate, We shall find many of these relationships of little consequence. 
Those of greatest concern will be the relationships bearing directly on the 
special problem we are endeavoring to solve. To control and predict our 
event process requires the manipulation of many other related variables. 
We shall find it difficult to isolate simple sequences of event processes 
for experimentation. 

Nonfunctional Relationships. For a given event process, some of the 
relationships in which it occurs will be found to be nonfunctional in 
nature; that is, they can be classified as nondependent or nondeterminate. 
We must be able to distinguish functional relationships from nonfunc- 
tional ones. This is done by the use of the “rule of exceptions.” If in a 
relationship A — B, B frequently occurs when we have evidence that A 
has not occurred, or if A frequently occurs without being followed by B, 
the relationship would be suspected of being nondeterminate. 

There is an additional check we can make. Through rational compari- 
sons we can discover incongruities or discrepancies in the nature of the 
event sequences being related which will lead us to regard them as non- 
functional. For example, night following day is a temporal sequence 
without exception, at least in areas distant from the poles of the earth. 
We do not, however, consider this sequence a functional relationship. 
The darkness of night is a function of certain particular relationships 
between the earth and the sun, but it is not a function of the day that pre- 
cedes it. 

The Certainty of Functional Determination. Successful prediction rests 
upon duplicating the past in the future. This duplication, in turn, is a 
function of the accuracy with which we are able to describe the determi- 
nants of the event process being predicted. In neither of these activities 
are we likely to be completely successful. Only if the event process is ex- 
tremely simple can we successfully describe most of its determinants. 
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Having accomplished this, we can never be assured that the future jan 
not present variations from the special conditions upon which our pre- 
diction rests. 

When event sequences are not simple in nature, we must be reconciled 
to error, both in our description of the determinants and in our predic- 
tion of the future functioning of these determinants. The meanings of 
functional relationships are established in empirical situations and are 
subject to the changes that these empirical situations may undergo. We 
are always in the process of making them more and more precise, but 
never can we make them completely stable. It follows that our strength 
of belief in the validity of a function, even when it is supported by 
dramatic verifications, should not prevent us from anticipating possible 
changes that would invalidate the function. 


The Empirical Verification of Functional Rel 


ationships. The dependa- 
bility 


and predictive effectiveness of a functional relationship is known as 
its precision. In the process of verification, the scientist learns how de- 
pendable the functional relationship is and how accurately it can be 
expected to predict under a variable set of conditions, 
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THE QUANTITATIVE NATURE OF FUNCTIONAL RELATIONSHIPS 


The most accurate description is attained when the different amounts 
of expression of a variable are represented by means of numbers. Func- 
tional relationships involve the changes in amount of one variable asso- 
ciated with the changes in amount of one or more other variables. A 
functional relationship is quantitatively described when it is expressed 
as the correlation between the amounts of the associated variables. 

The Use of Mathematical Equations. Describing variables by means of 
numbers makes possible the use of mathematical equations for represent- 
ing functional relationships. Such equations are of particular service to 
the scientist because they can be used to describe both the kind and 
the amount of the relationship. Despite the tremendous variety of func- 
tional relationships discovered among natural events, it is possible to de- 
vise a mathematical formula for representing every type of relationship. 
Two general classes of relation are the rectilinear and the curvilinear. In 
the first the function can be represented by a straight line, in the second 
by some type of curved line. In psychology, the relation between abilities, 
such as between arithmetic and reasoning abilities, is usually rectilinear. 
This means that for a given increase in value of one of the variables there 
is found to occur an increase of a constant amount in the other variable. 
The relation between improvement and practice is usually curvilinear, 
e.g., the learning curve for poetry. In the learning situation, a given in- 
crease in value of the practice variable is associated with decreasing 
amounts of increase in the achievement variable. 

In addition to depicting the kind of relationship, a mathematical equa- 
tion also expresses the extent or amount of the relationship. For example, 
there is a higher degree of relationship between the ability to add and 
the ability to multiply than there is between the ability to add and the 
ability to reason. If we quantified each relationship by means of some 
index like the coefficient of correlation, thus expressing the degree of re- 
lationship between each pair of variables, we would find the index be- 
tween the ability to add and the ability to multiply to have a higher 
value than the index between the ability to add and the ability to reason. 

The Close Relationship between Mathematical Theory and Empirical 
Knowledge. Science does not tolerate a substitution of theory for the 
facts of experience. Either the theory expresses meanings that are ap- 
plicable to event processes in nature or it merely expresses the relation- 
ships of the symbols which comprise it. If the latter, then the theory is 
Sterile as a procedural tool of science. 

In expressing functional relationships in the form of abstract mathe- 
matical equations, it is important that the meanings of these equations 
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and any statistical manipulation we perform on them be continually re 
ferred back to the natural event processes from which the equations ong 
nally were derived. To be useful, the meanings of any formal set o 
numerical propositions must be applicable to the corresponding <r 
of determinant event processes. There is nothing inherent in the logic 0 
mathematics, however, that enables us to determine the applicability of a 
mathematical or numerical meaning to a natural process. The possibility 
always exists that the mathematical operations we use on our numbers 
are not applicable to the empirical facts. The meanings derived from 
an equation are then not necessarily assignable to the event processes 
under study. 

Mathematical equations have been demonstrated to have great power 
for discovering new meanings for the sciences. The mathematical manipu- 
lation of quantities affords us a tremendous leverage over “the unknown 
because of the very abstract nature of numerical symbols. The fact that 
we cannot compel empirical processes to jump through the hoop of the 
calculus does not mean that this mathematical tool is useless for deriving 
meanings for our functional relationships. It is necessary, however, once 
we have derived meanings from a mathematical equation, to justify the 
application of these meanings to the empirical determinant processes to 
which we desire to assign them. ; 

Mathematical Equations and Scientific Laws. Mathematical equations 
make possible the most accurate statement of scientific laws. A scientific 
law is a functional relationship that has high predictive value in a wide 
variety of situations. Such laws have their beginning in empirically ob- 
served relationships among natural event processes. To be usable under 
a variety of conditions, a functional relationship must be abstracted from 
many of the specific contexts in which it occurs. By repeated testing of the 
function under a variety of conditions, it is possible to learn in what way 


and to what degree the function can be described without reference to 
particular specific conditions. Gradually, 


into a law. This law merely refers to the 


tional relationship. Mathematical equations are particularly appropriate 
for representing a generalized functio 


nal relationship because mathe- 
matical symbols can be assigned whatever meanings are required to rep- 
resent the function at that level of generality where it will have high 
predictive value. 


the function can be generalize 
statistical regularity of the func- 


THE STUDY OF FUNCTIONAL RELATIONSHIPS IN BEHAVIOR 


The Complexity of Psychological Determinants. The most consistent 
characteristic of human behavior is the complexity of its determinants. 
We must think of every behavior Sequence as stemming from many 


* 
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determinants. Some of these initiate their influence at the time of the 
conception of the individual, others come into play just prior to the 
occurrence of the observed behavior, while numerous others arise in 
between these two extremes on the time axis. Behavioral processes do 
not function independently but ineract with each other in many intricate 
and complex ways. Combinations of determinants act over various in- 
tervals: sometimes forming genetic series occupying hours, days, months, 
or years; at other times acting for merely a second or as a simultaneous 
flash occurrence. 

Some determinants of behavior are little understood at present, and the 
names applied to them are lacking in empirical significance. Others, how- 
ever, are accurately described and the nature of their contribution is fairly 
well understood. 

There is little justification for oversimplifying our conception of the 
determination of human behavior. Seldom can a behavior process be com- 
pletely represented as a simple response unit; nor can the determinant 
conditions be adequately analyzed into simple stimulus situations. The 
psychologist can muster little confidence in the logical reduction of be- 
havior determinants to a state of abstracted simplification expressed in 
the form of simple stimulus-response sequences. 

Narrowing the Area of Functional Relationships in a Study. One of the 
first steps we take in studying particular behavior processes is to dis- 
cover the area of functional relationships in which we think the processes 
can be found. As we have previously learned, no process is ever found 
in isolation; it is always discovered intricately interwoven with a large 
number of other processes. Primary experience consists of sequences, 
systems, totalities, and organizations of processes. From among these 
multifarious spatial and temporal contextual relationships we must dis- 
Cover those that are pertinent to the behavior process we wish to under- 
stand, 

We seek the determinants of a given behavior process among the many 
interrelationships that it exhibits with other behavior processes. To be 
able to predict the process accurately we must understand its relationships 
and be able to reconstruct these relationships. The contribution of any 
One of these relationships as a determinant will be conditioned by many 
factors, The question of just how many relationships must be recon- 
Structed and just which ones of the many relationships will be required 
must be discovered by empirical investigations. 

One factor that will restrict the area of relationships needing investiga- 
tion is the nature of the problem being studied. We can disregard those 
relationships that seem not to come within the general purview of the 
Problem. This, however, may not place sufficiently narrow limits on the 

nown relationships to make an empirical analysis feasible. Additional 
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limitation of the field for empirical investigation can be achieved y A 
fully examining the nature of the relationships and — sage ‘ae 
sequences in which, from the standpoint of logic, there — abet 
close determinant relation. Further restriction can be obtained * . 11 
ing our attention on systems of relationships that are tempor à ; aR 
spatially close to the behavior process under study. The more beige 
contiguous in time and space the determinant is to the process, 
greater its contribution is likely to be. ; oi 
An Example. Let us illustrate some of the foregoing points with a pro i 
lem concerned with the determinants of an automobile accident. Suppose 
that Mr. Smith, in trying to pass the car ahead of him, had a very 3 
ing experience. This vividly brought to his attention the dangers ee 
in unsafe driving, and is presently stimulating him to give a great dea a 
thought to the analysis of why he did what he did. He wishes to careers 
an order of events that will explain why he had the harrowing experience: 
He surveys the situation and brings to light the following facts which 
might bear on the problem: his erratic driving may have resulted “aie 
his not feeling well because of a protracted head cold; his attitude abou 
safe driving may have become lax from just having been delayed at an 
intersection during an unwarranted traffic snarl; he may not have yn 
actively attending to the driving task because he was thinking of 
bawling out his boss gave him at noon; his car may have been at fau 
in not accelerating fast enough as he passed the other car, etc. We wi 
have for consideration several areas where determining conditions may be 
found. These are relevant behavior processes that need further study. ; 
All of the foregoing reasons appear to have relevancy for the near- 
accident situation, and thus they all come under the purview of the prob- 
lem. There is need, however, for restricting the variables to some degree 
in order to facilitate the discovery of an acceptable explanation. Con- 
tinuing his analysis, Mr. Smith remembers that the car was recently re- 
paired and was presumably in excellent mechanical condition, a fact 
that he had momentarily forgotten. One of the relevant areas can now he 
passed over. The area of driver attitude appears to hold promise. Further 
study reveals that Mr. smith had been delayed at the office and ee 
getting home late, that there is often a family altercation when he arrives 
late for dinner, that he was hurrying to avoid being late, that the traffic 
jam at the intersection had frustrated him and had stimulated him to 
make up time, and that not only was he going to be late but he would 
have to tell his wife about the trouble with the boss. We find in this area 
of attitude events which are logically closely associated with the near- 
accident behavior, and which are also closely contiguous to this behavior 
in both temporal and spatial contexts. 


Approximating the Determinants of a Behavior Process. The solutions 
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to most psychological problems are obtained only after many empirical 
studies. The procedure is one of approximating more and more closely 
to an accurate description of the specific determinant conditions. The 
procedure to be followed bears repeating. We first must discover and 
describe those behavior systems that appear to contain important determi- 
nants. Then those systems that are relevant to our problem must be 
carefully analyzed. Next, particular behavior sequences that are logically 
pertinent to our problem must be discovered and further examined. 
Finally, these behavior sequences must be studied under empirical con- 
ditions with the expectation that behavior processes containing the solu- 
tion will be found. 

It is not necessary to achieve an accurate description of a behavior 
sequence before we hazard a judgment concerning its pertinence for our 
problem. From the first few approximation studies enough knowledge 
will be gained to justify the complete elimination of many relationships 
that at first appeared relevant. Likewise, our knowledge of some relation- 
ships will be sufficient to tag them as meriting more intensive examination, 

According to our findings in these early studies, we proceed to elimi- 
nate those sequences that do not contribute to our problem and to 
sharpen our description of those that contain usable functional relation- 
ships. By repeating this procedure several times we gradually improve 
our description of the determinant conditions. 

In later studies, our attention is focused on determining the relative 
importance of the several relationships still remaining. This is a difficult 
task. The contribution of any given variable is determined, in part, by 
its relationships with other variables, even relationships with those 
variables that have been eliminated earlier in the investigation. There- 
fore, as we proceed with the approximation studies it is important to 
note how the contribution of any given behavior process changes with 
the elimination and retention of other processes. 

Once we have made a final selection of the behavior processes to be 
retained, further studies are required in order to assess accurately the 
functional relationships involved. This is especially true when we desire 
to abstract the relationships from the concrete characteristics of the 
empirical testing situations. The process of abstraction does not affect 
each of the several determinant relationships equally, and the differential 
effects of the abstraction must be known before an accurate generaliza- 
tion of the functional relationships can be made. 

Psychological Functions Stated as Statistical Laws. The verification of 
a psychological function in a variety of situations enables us to determine 
its predictive effectiveness as a statistical law. A statistical law states the 
expectations of successful prediction in the form of a probability state- 
ment. The evidence favorable to a prediction is weighed against the evi- 
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dence unfavorable to the prediction. The statistical statement is derived 
from the successes and failures obtained in empirical verification studies. 
The functional relationship is then expressed in terms of the probability 
of making a successful prediction. 

The statement of probability should take account of the contribution 
of all of the known determinants. A statistical law attempts to summarize 
the findings of the past. This is very difficult to do, especially in global 
behavior situations where many determinants, varying in importance, are 
functioning. Only when the contributions of the determinants are accu- 
rately quantified, however, can a high level of precision be achieved in 
the over-all probability statement. 

Accounting for Failures. In a deterministic world, failures, like suc- 
cesses, are to be traced to specific sequences of events. The failure of a 
behavior process to occur as we predict it is frequently ascribed to chance 
or accident. This is another way of saying that we do not know why the 
prediction failed. If we compare the behavior processes occurring during 
the failures with those occurring during the successes, we frequently can 
learn about differences that are critical. We can then attribute the failures 
to a particular organization of behavior processes and do not need to 
refer them to chance or accident. 

There are two primary reasons why failures might occur. They may be 
attributed to our not reproducing all of the event sequences that our 
earlier analyses show are necessary for the prediction. They may be 
attributed to the operation of interfering or inhibitory variables that were 
never encountered in our earlier analyses and were thus not isolated and 
described. In the first instance, if we learn what the missing event se- 
quences are, we can modify our procedures where necessary and reduce 
the probability of subsequent failures. In the second instance, if we can 
discover and isolate the interfering or inhibitory variables we may be 


able to devise control procedures that will prevent their occurrence in 
subsequent predictions, 
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CHAPTER 5 


The Control of Psychological Variables 


One of the most important problems facing the scientific a = 
his attempt to discover the fundamental orders underlying human “a 
havior is that of controlling variables. It should be obvious that if a 
relationship is to be discovered, regardless of its nature, it is 5 5 
separate the variables thought to be related from other variables, at 3 
to the extent that the variables under study can be observed and ee 
independently of other variables that are not of immediate concern. If p i 
are to realize the aim of prediction, it is necessary to separate and contio 
the effects of the several variables considered pertinent to our prediction. 


THREE OBJECTIVES OF CONTROL 


in 

As psychologists, we are faced with a very difficult task in ee 
control of the complex of variables that condition even the most simp i 
expressions of behavior. To discover the factors underlying any as 
activity we must analyze the variation in numerous determiners, discov G 
the nature and importance of combinations of these determiners, e 
when possible, measure the interaction among individual determiners 
and among combinations of determiners. We 


the following purposes: (1) to isolate the 
in combinations; (2) to vary them as magnitudes either singly or in com- 
binations; and (3) to describe quantitatively the extent of their expres” 
sion and their interacting effects, again, either as single determiners or 3° 
combinations of determiners, 

Control to Achieve Isolation. In its simple 
is used when it is desirable to rule out or keep constant the effects of 2 
variable. This would occur in the isolation and elimination of extraneous 
noises in an experiment involving a study of auditory thresholds. Isolation 


i i . . in 
in this instance might be accomplished by performing the experiment 1 
a soundproof room. 


i aş for 
seek to control variables f 4 
a ee An 
determiners individually a 


rol 
st form, this level of contro 


Isolation may involve estimating the magnitudinal changes in a varr 
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able. For instance, sometimes in isolating a variable we need to know 
when the effect of the variable has been reduced to zero or has been 
made constant in amount. We can readily determine this if we can meas- 
ure the expressions of the variable. 

Control to Achieve Changes of Magnitude. This refers to a degree 
of control sufficient to enable us to change directly the expression of the 
variable. In most investigations we are not simply interested in learning 
if a variable has an effect on the outcome; we desire also to know how 
much effect the variable is contributing. To achieve this it is necessary 
that we be able to vary the magnitude of the pertinent variables under 
study, For example, in studying the relationship between high school 
Preparation and college success we want to know how much contribution 
high school training makes to the student’s work in college. 

Control to Achieve Quantitative Evaluation. This is the highest level 
of control. We must not only know that a variable is large or small but 
must be able to express the magnitude of the variable in terms of some 
numerical value. We are not simply interested in knowing that one ex- 
Pression of a variable is larger or smaller than another; we want to know 
how much larger or smaller. We desire a quantitative statement of the 
difference. Similarly, we are not simply interested in knowing that two 
variables are functionally related, either positively or negatively; we 
want to know the extent or the amount of the relationship. We desire a 
numerical estimation that will indicate at what point the relation falls 
On a continuum that varies from zero relation at the one end to perfect 
relation at the other. 

A majority of the variables in psychology are fundamentally continuous 
in nature or can be considered to operate as if they were continuous 
in nature, and therefore we can assume they are measurable. Their 
Complexity, however, makes the problem of measurement difficult. Much 
of the time and effort of the psychologist is spent in evolving measuring 
devices by which he can obtain the precision of control that will enable 
him to make quantitative evaluations of his variables and their rela- 
tionships, 


A STUDY ILLUSTRATING PROBLEMS OF CONTROL 


The Problem to Be Investigated. The project now to be described 
arose from practical problems involved in driving an automobile at night. 
15 form of visual response that has bearing on night-driving performance 
is called glare blindness. On the highway at night the driver faces the 
bright headlights of oncoming cars. If he fixates on the highway ahead of 
his car, which he must do to drive safely, he cannot escape this source 
of intermittent bright stimulation. The eyes are forced to adapt alternately 
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to brightness and dimness of illumination. Glare blindness refers to the 
experience immediately following the bright headlight stimulation when 
the driver's eyes are not dark-adapted, and when, as a consequence, he 
cannot accurately discriminate the objects along the road. This temporary 
blindness from glare stimulation disappears when his eyes have had 
time to adapt to the dimmer light conditions of the road ahead. , 

For many years it has been known that vitamin A is associated with 
the retina’s power to adapt to dark conditions of illumination. Increased 
amounts of vitamin A in the retina are associated with higher rates of 
recovery from glare stimulation. The question arises as to whether changes 
in the vitamin A intake through diet regulation would have any effect 
upon the driver's ability to withstand the glare of oncoming headlights. 
The question selected for study was framed as follows: Do changes in 
vitamin A intake through regulation of the diet affect the individual's rate 
of recovery from exposure to bright light stimulation? 

The Principal Experimental Procedures. College students were used as 
subjects. Variation in the dietary intake of vitamin A was accomplished 
by having the students go on a special dietary regimen: three weeks on a 
normal diet, three weeks on a diet high in vitamin A content, and three 
weeks on a diet low in vitamin A. 

The response of the eyes was measured by means of a glare recovery 
test. This consisted of a lighttight box 12 by 12 by 18 inches in size. 1 
headlamp of an automobile was placed at one end of the box behin 
an opalescent glass plate. At the opposite end there was a viewing ape" 
ture through which the subject could perceive the bright light. In front 
of the glass plate was a test object in the form of an arrow. This object 
was withdrawn from view during the bright light stimulation and re- 
turned to the field of vision at the instant the light was turned off. The 
arrow could be rotated in the four quadrants of space as seen by the 
subject. 

In the testing situation the subject held his head tightly against the 
edge of the visor of the viewing aperture. The bright light was turne 
on for 20 seconds. The subject then reported the direction of the arrow 
as soon as he could see it. The arrow was rotated to a second and again to 
a third position, and the subject reported the direction he thought it 
pointed to at each setting. The time was recorded from the instant that 
the bright light was turned off to the instant the subject correctly reporte 
the position of the arrow. 


The Control of the Variables. Controlling the Diet Variable. A list of 
staple foods, high and low in vitami 
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trating on foods low in vitamin A content. The divergence of the special 
diets from the normal diet of the subjects was very extreme, the subjects 
frequently commenting on the difficulty of maintaining the special dietary 
regimen for as long a period as three weeks. 

Controlling Light-exposure Variables. The time of exposure of the 
subject to the bright light stimulation was held relatively constant through 
use of a stop watch. Variation in the intensity of the bright light stimula- 
tion was minimized by periodically checking the brilliance of the light by 
means of a light meter. The subject’s pretest level of light adaptation was 
made approximately constant by having him remain in the testing room 
for a period of 20 minutes preceding the administration of the test. The 
subject was cautioned during the test to keep his head tightly against 
the visor of the viewing aperture in order to minimize the infiltration 
of stray light into the testing box. 

Controlling Individual Differences. Individuals differ in respect to their 
ability to withstand glare stimulation and also in their ability to assimilate 
vitamin A. If a different group of subjects had been used in each of the 
three experimental conditions, we would have had to make sure that the 
three groups were equal in respect to these two variables. To circumvent 
this difficult problem each subject took part in each phase of the experi- 
ment, Any individual differences were then introduced equally into each 
of the three experimental conditions. 

Controlling Interphase Effects. It is a well-documented fact that when 
an individual participates in every phase of an experiment, the effects 
of a given phase may continue forward in time to determine in part 
what he will do in a later phase. In the glare recovery experiment, the 
effect of the low vitamin A diet might be due in part to whether the sub- 
ject had previously been on a normal diet or on a diet high in vitamin A. 
In the absence of knowledge of how one phase affects another phase, 
it is necessary to control these effects by equalizing them. This is accom- 
Plished by arranging the experimental conditions in several sequences 
and having different subjects participate in the different sequences. In 
the glare recovery experiment, the conditions were arranged so that 
each experimental condition preceded every other condition, different 
Subjects being randomly assigned to the different sequences or orders of 
Conditions, 

Controlling Chance Factors. In addition to the variables mentioned 
above, there were many others operating to affect an individuals re- 
SPonses to light stimulation. Day-to-day fluctuations in physical effi- 
en; changes in the room temperature and illumination due to varia- 
tions in the outside weather conditions, variation in the subject's will- 
Mgness to cooperate in subscribing to the requirements of the experi- 
Mental conditions, fluctuations in the attention and in the precision of 
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response of the experimenters, and other such factors were operating. 
It was impossible to eliminate all of these factors, but an attempt was 
made to equalize their effects. This was done by arranging the experi- 
mental procedure so that each factor operated equally in every phase 
of the experiment. For example, in the experimental situation, every 
experimenter followed the same testing procedures, participated in test. 
ing every subject, and operated the test in each of the experimental 
conditions. Thus, any variation attributable to experimenters tended to 
be distributed in all phases of the testing. 

The Experimental Findings. Although we are not here particularly 
concerned with the experimental results, the reader may be interested in 
the outcome of the experiment. Significant differences in the time of 
recovery from glare stimulation were found for the three conditions. The 
fastest recovery was achieved under the high vitamin A condition, the 
slowest under the low vitamin A condition. This was true for all subjects- 
Responses under the normal dict were never faster than under the high 
diet, but were not always faster than under the low dict. The high- 
vitamin-diet regimen was much more severe than one would want to 
undertake for any protracted length of time, indicating that supplemen- 
tation by vitamin capsules would probably prove less objectionable than 
supplementation by diet regulation. 


COMMON VARIABLES NEEDING CONTROL IN 
PSYCHOLOGICAL INVESTIGATIONS 


Some General Areas of Behavior Determiners. Any factor that func- 
tions as a determiner of behavior at some time may require control. 
Below are listed some general areas of determinants of human behavior. 
and within each area some subareas are given. No attempt is made to 
be exhaustive of the possible areas or of the subareas within any are® 
The areas are sufficiently general in nature so that determiners from 
most of them will play a role in nearly every study in which human 
subjects are used. The purpose for listing the variables at this point is z 
remind ourselves of the breadth and kind of factors for which controls 


re i ci 5 
are needed, and to add some concreteness of meaning to our thinking 
about problems of control. 


Examples of General Areas and Subareas of Determiners 


SCHOOLING 


Incentive to work with books 
Level of success in school—general average 
Level of success in school—in different subjects 
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Amount of training received—estimate of over-all average 
Amount of training received—averages for specific subjects 
Specific subjects liked 

Specific subjects disliked 

Speed of work in different subjects 

Accuracy of work in different subjects 

Specialized training received, e. g., skilled trades 


SKILLS 


Sports played—kinds and amounts 
Hobbies practiced—kinds and extent 
Musical instruments played—extent of skill 
Mechanical skills—kinds and extent 
Physical deficiencies affecting skills 


FACTORS RELATED TO MATURITY 


Chronological age 

Physiological maturity 

Psychological maturity—interests, drives. emotions 
Amount of experience in special areas of development 


CULTURAL FACTORS 

Exposure to foreign language 

Exposure to foreign culture and ideologies 
Degree of assimilation of American culture 
Exposure to particular regional cultural patterns 


SOCIAL EXPERIENCE 
Preferred social activities 
Social activities disliked 
Participation in social activities at school— 
kinds and extent 
articipation in social activities at home— 
kinds and extent 
articipation in group sports—kinds and extent 
articipation in group hobbies—kinds and extent 
Social activities connected with vocational interests 


PHYSIOLOGICAL FACTORS 


Physiological development 
motional development 
eneral physical well-being 
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Specific physical impairments 

Susceptibility to particular diseases 

Level of energy output 


Some Examples of Specific Behavior Determiners. In conducting i 
particular study, more specific factors will be encountered than those 
considered above. These will vary widely with the nature of the problem 
being investigated. Suppose the study involves the administration of F 
group intelligence test, then specific factors would have to be consider 1 
in connection with the subjects taking the test, with the test itself, ano 


with the testing procedures. A few examples of such specific factors 
follow: 


Subject’s degree of cooperation 

Subject’s anxiety relative to rate of work 
Validity of test items 

Adequacy of test instructions 

Accuracy of norms and standards 
Adherence to testing procedures 
Prevention of distractions during the test 
Accuracy of scoring of test responses 


PROCEDURES FOR ACHIEVING CONTROL OF VARIABLES 


Over many years of research, scientists have developed and adapted 
control procedures to meet a large variety of specific problems. It E a 
difficult task to find a simple classification scheme into which all of t 55 
individual procedures can be fitted. Three categories are selected te 
are sufficiently comprehensive to encompass all of the many diverse 1 
of control procedures available today. According to the nature of th 
individual procedure applied to the variable, control can be achieve 


through physical manipulation, through procedures of selection, an 
through statistical manipulation. 


Control by Some Form of P 
cedures in which there is a mo 
miner itself or the immediate 


hysical Manipulation. This refers to et 
re or less direct manipulation of the dete 


hai iner. 
conditions that give rise to the determin” 
(Mechanical Means. One form of manipulation comprises mechani¢ 


methods. Here we may cite the familiar apparatus controls of the labor” 
tory, such as the exposure drums for presenting memory materials, the 
insulating materials for soundproofing rooms, the tachistoscopes . 
exposing perceptual stimuli, and the problem boxes and mazes for meas 
uring learning responses. 

Electrical Means, A second form of physical manipulation utilizes 
electrical means to effect the control. This procedure has very wide 
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application in the generation of sounds in experiments in hearing and 
in the use of telechron and other constant speed motors for driving 
apparatus, controlling relays, and measuring time intervals. 

Surgical Means. The use of operative surgery also effects a direct con- 
trol over the physical mechanisms determining behavior. Experiments 
on the brain are a case in point. Another would be the surgical removal 
of glands, such as the thyroid or the adrenals. Experiments on gonadec- 
tomized animals are an application of this procedure. 

Pharmacological Means. A fourth type of procedure uses drugs, change 
of diet, feeding of gland extracts, etc., to effect control over certain bio- 
chemical determiners of behavior. Studies of the use of dilantin in the 
treatment of epilepsy and the use of pentobarbital in the release of the 
repressed fears of veterans suffering battle fatigue are illustrative of 
aj aioe that the use of drugs can effect. ; 

easons for Not Using Physical Manipulation. Control through the 
physical manipulation of the variable itself or its immediate determiners 
may not be utilized for one of three reasons; namely, physical manipu- 
lation may be undesirable, it may be difficult to achieve, or it may be 
impossible to achieve with available techniques. 

Obviously there are regions of human behavior, such as those of sex 
and inheritance, where we are woefully in need of additional knowl- 
edge, but in which direct physical manipulation of the variables is ruled 
out. 

There are problems in social behavior in which manipulation of the 
Variables is possible but very difficult for an individual investigator to 
achieve. An example is the study of the effects of physiological growth on 
the social behavior of children. Such a problem would require the genetic 
Study of many children over several age levels, and would not only in- 
volve a long-term study but would be a highly expensive undertaking. 

Ost individual investigators would not command the facilities required 
to undertake such an investigation. 

Some variables are not amenable to control through physical manipu- 
lation. The variable of age is one; the variable of past experience, for the 
most part, is another. In studies of learning, variations in the past experi- 
ence of the subjects are always present, but they can seldom be directly 
manipulated. We cannot equate the past experience of different subjects 
merely by some kind of condensed course of training in which each in- 
dividual is schooled in the types of experience that the other individuals 
Possess but that he lacks. . 

_ The Need for Other Methods of Control. Although physical manipula- 
tion of variables has played the primary role in the development of 
Control procedures in the physical sciences, this method leaves unsolved 
a large number of the control problems of the psychological and social 
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sciences. We must remember, however, that it is not 1 te 
able to manipulate our variables physically in order to smi a 
scientifically. Astronomy is a good example of a science 9 crit E 
power of physical manipulation over most of the variables w ith : 1 
is concerned. The psychologist similarly finds himself or eat 
manipulatory control in many behavior situations involving ee be ae 
ment of the individual. It is therefore necessary for him to effect a 

ili her forms of control procedures. ; 
ji ata by Procedures of Selective, The Importance of Control aie 
Selection. Control by means of selection proves of great value iri * 
chology because it can be applied to so many different pises © a 
scientific investigation. Variables which would otherwise gunem i 
have been subjected to rigorous indirect manipulation, tsi 
no way be considered less effective than the procedures of en A 
cal manipulation. Control through selection has enabled the psyche Ea i 
to study composites of determiners as they function in global mi 185 
problem in control that the method of physical manipulation is 
adapted to solve. . al 

The Selection of Materials. An important means of effecting contu 
over psychological variables is through the selection of aig 
materials. The field of learning is replete with control problems so 1 0 
by this procedure. For example, in studying the relation between 0 
amount of material to be learned and the time required for lear i 255 
the investigator must use a large number of units of material tha i af 
comparable in terms of the ease of learning. The several eee $ 
the experiment require that differing amounts of material be , 
Of course, there can be no duplication of materials in the several co! di- 
tions. Obviously, once a subject learns the material of one of the reel 
tions, that material cannot be used again in any of the other conditio at 
There then is introduced the possibility that a spurious factor ine 
affect the speed of learning, namely, differences in the difficulty © 
material utilized in the several experimental conditions. ai? 

Referring to our example, suppose that our study involves a gompa, 
son of the amount of time required per syllable to learn lists of 8, 10. ing 
and 14 syllables. If the 8-syllable list contains more difficult Ene 
units than the 10-syllable list, more time for learning might be ee y 
for the 8-syllable list. Difficulty of material is then a factor e 
affecting the relationship between amount of material and time c a 
for learning. By means of a careful evaluation of the difficulty a 
units of material and a selection for utilization of only those units tet? 
are comparable in difficulty, the spurious factor is minimized as a de 
miner of the learning time. 
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The Selection of Subjects. In the selection of the particular individuals 
to act as subjects in a study, we can manipulate indirectly many of the 
variables that are beyond direct physical manipulation, such as past 
experience, age, inherited dispositions, and the like. Suppose, as di- 
rectors of operations in a factory, we are interested in studying the rela- 
tive effectiveness of two different work procedures. We wish to investi- 
gate the relative effectiveness of two methods of operating a lathe when 
the lathe experience of the workers is held constant. If differences in the 
amount of lathe-operating experience were allowed to enter the experi- 
ment, we would not be justified in attributing the results to differences 
in the methods of work. We gain control over past lathe-operating ex- 
perience by selecting workers who have equal amounts of such experi- 
ence. We can arrange the workers in pairs having equal experience and 
then randomly assign a worker of each pair to each of the experimental 
conditions, In a similar way, such global factors as ability, interest, atti- 
tude, ete., can be brought under control. 

The Selection of Data. A further application of selection procedures 
is found in the selection of the data to be analyzed. When this method 
is used, the problem to be investigated is so designed that data already 
available can be utilized in discovering possible solutions. This is exem- 
plified in some of the problems of social psychology where such primary 
sources of data as the records of public institutions, various collections 
of vital statistics, government census reports, and the like, may provide 
an investigator with the facts he needs for studying certain social factors 
Contributing to behavior. Similarly, this procedure may be used when 
the behavior to be studied is controlled in some manner by a federal or 
State institution, as in a reform school or a state prison. Here we might 
not be allowed to interfere with the behavior routine of the inmates but 
might be furnished records from which we could select data pertinent 
to our problem. 

Although the method of selection of data may give the investigator an 
excellent leverage on the global composites of behavior that are repre- 
sented by the categories used in the primary sources of the data, the 
Procedure does not make possible the isolation and evaluation of every 
factor that conditions the behavior being investigated. Only those factors 
Can be separated for study that are separately measured in the data. 
For example, juvenile delinquency can be studied in reference to the 
Single factor of divorce of parents only if the records provide information 
Concerning the marriage status of the parents of the delinquents. Fur- 
thermore, the quality of control that can be achieved is directly condi- 
tioned by the completeness and accuracy of the records. The degree of 
confidence that can be placed in the data and thus in any generalizations 
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made from analyzing the data is a direct function of the completeness 

ccuracy of the facts recorded. ; 
E 25 Statistical Manipulation of the Data. This procedure ex- 
tends control beyond that which is obtainable through the other = 
procedures. Statistical analyses, of course, are used in conjunction 5 
other scientific procedures, but as a method for controlling variab es 
they have a unique contribution to make that deserves our attention. i 

Statistical Control in Complex Behavior. Statistical controls are par 
ticularly adapted to the multiple variable situations found in pimped 
because they enable us to discover the determiners of behavior wher 
these determiners are not amenable to direct physical e e 
Furthermore, they enable us to approximate the relative importance o 
the contribution of each of several factors in the determination of 3 
event, a problem that neither of the other two methods has adequately 
solved. 

In the laboratory situation, where it is possible to isolate and r 
each determiner independently of the others, the tracing and . 
of functional relationships can readily be achieved. When we move ‘al 
of the laboratory situation to seek functional relationships in the glo A 
behavior that characterizes everyday life, the problem is made arees 
ingly difficult. Here the techniques of physical manipulation are a 
limited in their applicability. The procedures of selection involving a 
terials, subjects, and data sometimes enable us to isolate and = 
many of these global factors. We find, however, that these ra F 
seldom are sufficient to carry us all of the way to an evaluation of 18 
relative significance of several global factors simultaneously functioning 
in complex combinations. It is in such problems as this that statistica 
control has proved of outstanding value. 124 118 

An Example. The consideration of a concrete situation will assist F 
in comprehending the power found in statistical procedures of S 
Let us consider the problem of predicting success in college from ; 
knowledge of the experience and abilities of students, College succes 
is a global complex of phenomena that results from the functioning 70 
several other global complexes of factors such as ability to read, hig 
school training, amount of time devoted to study, abstract intellect? 
ability, and the like. Such determiners are beyond physical manipulatio 9 
and their complex interactions are not amenable to separation by a 
cedures using selection. Statistical procedures, however, are availab 


by which the effects of each of the global units can be isolated 7 5 
evaluated. Furthermore, an estimation of the relative importance of t 
several complexes of determiners can also be made. 


In a study by the authors of a group of college students containing f 
large number of students on probationary status, the global factors 


The Control of Psychological Variables 87 


high school preparation, reading ability, and scholastic aptitude were 
analyzed as possible determiners of scholarship deficiency. High school 
preparation was measured by translating course grades into grade points 
and averaging them. A special reading test was used involving subject 
matter from several of the common areas of college instruction. A widely 
used test of intellectual capacity provided a measure of scholastic apti- 
tude. The grade-point index served as the criterion of success in college. 

The scores on the three predictor variables were correlated with the 
grade-point indices, providing three coefficients of correlation. As was 
expected, each coefficient had a rather high positive value, as is shown 
in column 2 of Table 1. These generally high relationships are a signifi- 


Table 1. Determination of Individual Differences in College Scholarship by 
Each of the Variables of High School Achievement, Reading 
Ability, and Scholastic Aptitude 


Ee 


Correlation of variable with college 
scholarship 
Variable 
Other predictors | Other predictors 
not controlled controlled 

High school preparation. .. 67 55 
Reading ability. 51 23 
Scholastic aptitude. ...-.--- 46 13 


— — — 


probably functionally associ- 


cant fact, indicating that each variable is 
variable. This means that in 


ated in a positive manner with every other 
a situation where every variable is positively operating, the functional 
à any given predictor and the criterion is condi- 
tioned by two factors, namely, the relation this predictor has with the 
Other predictors and its relation, in turn, with the criterion. 

Before we can determine the exact correlation between any given 
Predictor and the criterion, we must find a method for holding constant 
the effects of the other predictor variables. By applying such a method 
to each predictor, in turn, it is possible to determine the functional 
relationship holding between this predictor and the criterion when the 
effects of other predictor variables are held constant. 

Although it is beyond the scope of this book to develop the signifi- 
Cance of the methods by which this can be accomplished, the student 
Should be aware of the fact that there are several statistical formulas 


relationship between 
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for holding constant the effects of several variables while pein ag 
functional relationship between other variables. In the examp rile i 
college students, the method used is called partial correlation, k sas 
simple coefficient of correlation, the coefficient of partial mr e 
expresses the degree of relation between two variables. It di 1 
the simple coefficient in that it measures the degree of relations ip 3 
tween two variables when the effects of other relevant variables are 5 
constant. In our example there were three partial correlation — 
as presented in Table 1. It will be noted that the three predictor var i 5 
correlate with the criterion in order of magnitude from greatest to 85 
as follows: high school preparation, reading ability, and scholastic 7 
tude. High school preparation seems to have contributed far more 0 
college success of these particular students than did their reading eed 
or their scholastic aptitude. Obviously, such findings as these have od 
cance for the supervision and control of behavior in important colleg 
scholarship situations. i on 
Two Conditions Essential to the Use of Statistical Controls. Two eo 
ditions must be met before we can use statistical procedures of . 
namely, the variables to be controlled must be measured, and 5 85 
vestigation must be so planned as to make available a measure o a 
systematic variation or error. In statistical control procedures, the 5 
values are manipulated; that is, the numbers representing the quan a 
tive characteristics of the variables are the units manipulated. y t 1 
characteristic being studied cannot be described numerically, than, 7 
cannot be controlled statistically. Designing the study to provide a Spd 
ure of unsystematic errors is required in order to evaluate the contr! om 
tion that these errors make toward the production of the in 
being examined. An accurate determination of the contribution of = 
pertinent variables is impossible when the part played by unsystema 
error variables cannot be estimated. — 
At this point the student may have difficulty getting an accurate 
understanding of error variables. To use the term loosely, an error aa 
able is any variable which operates to introduce inaccuracy in the mi 
tioning of the particular variables under study. Unsystematic mere 
chance error, sampling error, and experimental error are different po 
used to refer to the effects of some or all of these error variables. Fae 
reference will be made to these variables in subsequent discussions, ia 
it is beyond the scope of this book to develop the statistical formula 
and their meanings by which these error variables can be understood. 
Two Statistical Control Procedures. Two of the most frequently use 
statistical procedures for effecting an analysis of the functional relation” 
ships underlying complex behavior are the method of partial oome a 
already mentioned, and the method of the analysis of variance. 
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reader will find the concepts involved in these methods treated in ad- 
vanced texts in statistical methods. It is sufficient for our purposes to 
realize that there are control procedures available through statistical 
manipulation that open up large areas of psychological problems for- 
merly closed to scientific study. 

Choosing Control Procedures for an Investigation. The control pro- 
cedures we have considered—physical manipulation, selection of experi- 
mental materials, subjects, and data, and statistical analyses—are avail- 
able for controlling any of the multifarious variables with which the 
psychologist must deal. Of course, they will not control every variable 
equally well. The nature of the control procedures to be used on a given 
Occasion will vary with the nature of the variables to be studied and 
the purposes of the investigation. Sometimes the best control procedures 
that can be devised will fall short of attaining the desired degree of 
control, 

Inasmuch as we control variables to achieve definite purposes related 
to the objectives of our study, we should avail ourselves of any control 
Procedure through which we can get answers to the questions being 
investigated. We should not control variables just to be controlling 
variables, as occasionally seems to be the case even in some current 
Scientific studies. Neither should we restrict the controls to some one 
Particular kind because of an “idolatrous” concern for that particular 
kind of control procedure. There are experimentalists who do not take 
Seriously the control procedures offered by statistical methods; and 
there are statisticians who seem unaware of experimental techniques 
that they could use to advantage. The criterion for selecting a given 
Control procedure should be the extent to which it will enable us to 


Solve the problem under investigation. 


SYSTEMATIC AND UNSYSTEMATIC DETERMINERS 


Our success in controlling the variables of an experiment depends 
“Pon our knowing the kind of effect the variables have on the end 
results, The same variable may play different roles at different times. 

o learn the particular nature of the effect of a variable at a given time 
and under a particular set of conditions is the difficult task we face as 


Scientists. According to the general effect that a variable has on a phe- 
nomenon under a given set of conditions, it is classified as a systematic 


or an unsystematic variable. 
PN lie cea Variables. Vari at 
results are called systematic. By “cons 

nal average values obtained from our ana 

Smaller than they would have been had the 


ables that have a constant effect upon the 
tant effect” is meant that the 


lyses will be either larger or 
variable been inoperative 
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during the investigation. Among the systematic factors will be found 
the variables pertinent to the purpose of our investigation, that is, those 
variables we have elected to study. These are often called the experi- 
mental variables. 

There will be other factors which, if not controlled, will have constant 
effects upon the complex of variables to be studied. Potentially, they el 
systematic variables. These variables may be controlled in order either 
to eliminate entirely any effects that they may have, or to force them to 
contribute equally to all phases of the investigation. In the latter case 
the factors will not have a differential effect upon the end results. These 
potentially systematic variables, which should be prevented from con- 
tributing a constant effect to the end results, we shall call unwanted 
systematic variables. , 

Unsystematic Variables. Unsystematic determiners produce variation 
in the complex of variables under study, but they contribute equally in 
both positive and negative directions. They will sometimes increase and 
sometimes decrease the values of the complex of variables, but in the long 
run the final average values will be neither larger nor smaller than they 
would have been had the unsystematic variables been eliminated. 

We should not think that the effect of unsystematic factors is ruled 
out just because the chances are high that the final average values will 
be disturbed. The increased variation introduced into an experiment by 
unsystematic factors increases the difficulty of discovering and accurately 
evaluating the effects of the experimental variables. This problem wil 
receive further attention in later sections. 

In any given experiment, there will be unsystematic factors that are 
completely unknown to the investigator. Furthermore, the nature an 
extent of the effects of the unsystematic factors that are known may me 
so poorly understood and thus so inadequately controlled that their 
contribution may outweigh the contribution of the systematic factors 
under study. It is apparent, therefore, that the control of unsystematic 
variables is a significant problem in all psychological research. 


THE CONTROL OF EXPERIMENTAL VARIABLES 


The Problem. The 
one of isolating, 
form the precurs 


problem of controlling experimental variables 1S 
manipulating, and measuring one or more variables that 
or sequences related to one or more other variables that 
are wholly or partially unknown and about which information is de- 
sired. When the consequent sequences are wholly unknown, the study 
is designed in a way that will allow for detecting and recording the 
changes that would be expected if certain notions or hypotheses of the 


experimenter were true. When the nature of the expected changes is Par- 
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tially understood, the study is designed to get a more accurate description 
of the quantitative aspects of these changes as they relate to the quanti- 
tative aspects of the antecedent sequences. 

The manipulation of experimental variables is planned to achieve one 
of three general levels of control. One purpose is to produce expressions 
of the experimental variable, at such times as they can be observed, 
when the amount of the variable is not determinable. The effects of the 
variable are then compared with findings from situations in which the 
experimental variable is nonfunctional. A situation requiring more pre- 
Cise control over the experimental variable is demanded when it is neces- 
Sary to have several levels of expression of the variable, the amounts of 
the expression being unquantifiable or only roughly quantifiable. It is 
known that the two or more levels of expression vary in amount but the 
amount of the differences between levels is not known. A still higher 
degree of control is attained when the various levels of magnitude of the, 
experimental variable are measured. It is then possible to describe quan- 
titatively the amounts of expression and to measure the amount of dif- 
erence between various expressions. , ; 

All of the control procedures that the scientist has devised are avail- 
able for use in the controlling of experimental variables. The primary 
Considerations that determine the kinds of control procedures that should 

© used in any given study are the nature of the variables involved, the 
mature of the functional relations suspected to hold among the variables, 
and the particular questions that the investigator proposes to answer. 8 
n the glare recovery problem, the experimental variable was the 
amount of vitamin A in the retina. It was varied in reference to the power 
of the eyes to adapt to a reduction in illumination. These two variables 
Were known to be functionally related, the rate of Pete to a 
“ced illumination depending upon the amount of crag x t : 
tetina. The object of the experiment was to determine if this ve ona 
relationship could be affected by the manipulation of a persons dietary 
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Ore or less direct manipulation of the experimenta 1 N 
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description of the amount of vitamin A being assimilated under the 
several dietary conditions. Neither of these more precise measures of 
vitamin A was considered necessary to achieve the information needed 
for answering the question: Will changes in the assimilated amount of 
vitamin A resulting from changes in the daily diet affect the recovery 
time from glare stimulation? 

The control procedure adopted for quantitatively varying vitamin A 
illustrates one of the cardinal principles of experimental design, namely, 
that the precision of control effected for any variable should be gauged 
in terms of the purposes of the experiment. Precision over and above 
what is required to accomplish the experimental objectives is usually 
unproductive, although this added precision seldom affects the findings 
detrimentally. Occasionally, when more precise procedures than are 
actually required are used, additional factors or characteristics are dis- 
closed which less precise procedures would not reveal. 

In the glare recovery experiment, the consequent variable of the re- 
covery of the retina following glare stimulation was evaluated by meas- 
uring the time of recovery to the point when the target object was 
identified. The use of a stop watch was adequate. Of course, the experi- 
menters varied in their reaction times in manipulating the watch, but 
these variations were extremely small in comparison with the very long 
recovery times that were being measured. More precise measures could 
have been obtained by an electric timing device which would have been 
operated by the same switch that turned off the stimulus light. This 
greater precision appeared to promise no information beyond that which 
could be obtained by the use of the stop watch. 

A more critical problem was the determination of that instant when 
the subject recognized the target object. The particular level of distinct- 
ness in perception of the target—that is, whether the outline of the arrow 
was vague or clear—at which instant the stop watch was to be turned 
off, was not important. The important problem was to determine the 
recovery time in different subjects or in the same subject at different 
testings when recovery had reached approximately the same point oF 
level in so far as the capacity for recognizing objects is concerned. It 
will be remembered that the target object was made in the shape of an 
arrow, and the subject was required to tell the direction the arrow was 
pointing. In this manner it was thought that the measurement would be 
taken at approximately the same point in the recovery process of the 
eyes. Recognition of the target object itself was a variable, because 
subjects differed in their judgments of that instant when the target 
changed from unidentified to identified. There was considerable variation 
in willingness of different subjects to report the instant the target object 
was identified. Some subjects reported as soon as the outline of the arrow 
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was recognized, whereas others waited until they could identify the 
pointed end of the arrow. In the case of the former, the direction of the 
arrow was guessed rather than observed. This variation, of course, was 
not one of the experimental variables, but it is obvious that if uncon- 
trolled it would directly affect the experimental variable of time of re- 
covery, By rotating the arrow and requiring the subject to identify cor- 
rectly its direction in three successive positions, the effect of this poten- 
tially disturbing variable was minimized and probably completely 
eliminated. ; 

, A problem in which very precise control through physical manipula- 
tion would be effected is the determination of intensity thresholds in 
hearing, The problem might be limited to pure tones and involve the 
intensity thresholds of pure tones of different pitch. Control of these 
variables involves the use of such electrical devices as resistances, 
Capacitors, inductances, etc. Through these devices we are able to pro- 
duce pure tones of selected frequencies which can be varied systemati- 


cally in respect to their intensity. 

Control through Procedures of Selection. Ex 
Which experimental variables are controlled by selecting materials, select- 
mg subjects, and selecting data. In later discussions on control of un- 
Wanted systematic factors and unsystematie factors, further details on 
Selection methods of control will be presented. 

Control through Selection of Material. Let us consider the question of 
Whether or not the white rat can discriminate patterned stimuli. This 
Problem could have arisen from arguments that the rat does not have 
Pattern vision (ability to distinguish patterned stimuli) but only bright- 
Ness vision (ability to discriminate differences between light and dark). 


© study the question, we would have to select stimulus cards that varied 
; resenting equal relative 
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studying the problem would be to obtain college and noncollege popula- 
tions of equal intellectual caliber and train them to fly. Another way 
would be to use only college students, but separate them into groups 
that differ in general intellectual ability but which have been exposed 
to the same content in college courses. Although directed to the same 
problem, these two procedures would not be getting at the same ques- 
tion. In either procedure, the only method for setting up variation in the 
experimental variable would be that of selecting the subjects. The task 
would not be an easy one, because a rather complete knowledge of the 
intellectual capacity and specific educational achievements of potential 
subjects would be required as a basis for correctly selecting and assign- 
ing them. 

Control through Selection of Data. We can illustrate this control pro- 
cedure by a study of the relationship between brightness of illumination 
of highways and frequency of occurrence of night accidents. One sug- 
gestion that has been made for decreasing the frequency of night acci- 
dents is to improve highway illumination, particularly at hazardous 
points along the roadway. Here we can appeal to Dame Nature, who 
during most months of the year provides us with some nights on which 
the highways are dimly illuminated and other nights on which the high- 
ways are moderately well illuminated, such as when the moon is in full 
phase. The motor-vehicle department of most states keeps a permanent 
record of night accidents. By selecting nights that were known to be 
very dark and nights when there was considerable illumination from 
the moon and tabulating accident data for these nights, we could set up 
a test of the experimental problem of the relationship between illumina- 
tion of highways and frequency of night accidents. 

Control through Statistical Procedures. As previously noted, in many 
situations in psychology it is either impossible or undesirable to manipu- 
late the variables physically or to accomplish control through selection 
i an pir the experimental variables can be left to function eer 
natural conditions and can at the sa i i 2 much 
valuable information can be gal hes oe condi- 
tions in which the variables are rigorously isolated. Under methods of 
physical manipulation, the interrelations of several variables within à 
complex situation cannot be adequately varied under controlled condi- 
tions, and this means that the nature of interactive effects between vari- 
ables cannot be studied. Such interaction often yields to analysis by 
statistical procedures. 

Suppose that the problem set for investigation is the hypothesis that 
the superiority of the distributed method of learning is independent of 
the factors of age, sex, and the meaningfulness of the subject matter. 
Here there are three factors the interactive relationships among which 
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THE CONTROL OF UNWANTED SYSTEMATIC VARIABLES 


ete Problem. We are here concerned with the systematic effects that 
rite t be contributed by factors that are not part of the complex of 
aa Pl specifically under investigation. As previously pointed out, 

NY of these disturbing systematic factors that are known cannot be 
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controlled through direct means. It should be added that in investiga- 
tions involving the study of global responses, many of these factors re- 
main wholly unknown. When these unwanted variables are known, it is 
sometimes possible to eliminate them entirely by using physical manipu- 
lation, procedures of selection, or procedures of statistical control. But 
if we cannot completely remove these factors and if we are unable to 
measure their separate effects at the time we measure the effects of our 
experimental variables, the next best method for us to follow is to hold 
their contribution constant throughout all phases of the study. Under 
this controlled variation the factors will not function in a manner that 
will produce systematic changes in the final average values. 

Control through Physical Manipulation. This type of control can be 
illustrated by reference to the glare recovery study, A potentially sys- 
tematic factor affecting the recovery time was stray light entering the 
testing box through the view aperture. If uncontrolled, this stray light 
would directly affect the level of illumination in the testing box and 
thus the relative brilliance of the stimulus light during test runs. En- 
trance of stray light into the box should then be prevented. This was 
accomplished by compelling the subject to hold his head tightly against 
the visor of the aperture. 

In an experiment on the intensity threshold of pure tones, extraneous 
noises would have the systematic effect of raising the threshold values. 
By mechanically soundproofing the test room this factor would be brought 
under control. 

Control through Selection of Materials. In the early studies on memory 
it was found that the subject’s acquaintance with the experimental mate- 
rials often entered as a disturbing variable. The material to be memo- 
rized was not equally familiar to all subjects, therefore the learning of it 
was more difficult for some individuals than for others. This factor of 
difficulty then affected the rate of learning in experiments in which other 
variables were the object of study. Nonsense syllables were devised to 
control the factor by making the material equally unfamiliar to all sub- 
jects. Nonsense syllables are combinations of vowels and consonants that 
do not make sense. They provide material for memory experiments 
in which individual differences in familiarity with the material can be 
made to approach zero, as careful selection of the syllables makes them 
equally unfamiliar to most experimental subjects. 

Control through Random Selection and Assignment of Subjects. Logic 
of the Procedure. Control over unknown factors and over known but un- 
wanted relevant factors for which there are no measures can be ob- 
tained by planning the study so that these factors will be given equal 
opportunity to vary in all phases or conditions of the investigation. Ob- 
taining equal distribution of the effects of these factors is accomplished 
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at the time the individual subjects are selected for the investigation 
and is achieved by using the method of random selection of subjects. 
In the use of this control procedure the assumption is made that the 
factors to be controlled are randomly distributed in variable amounts 
and in variable combinations in the different experimental subjects. 
Selecting the subjects and assigning them to the several conditions of 
the experiment by some method that distributes them randomly in these 
conditions will thereby automatically distribute the factors at random 
within these conditions. Consequently, the unwanted factors will tend 
to contribute in the same way and to the same degree in each of the 
phases of the investigation. Such unsystematic variation will not then 
differentially affect the final average values. 

An Example. Let us consider the common problem of determining a 
difference between two variables. Suppose the problem is to investigate 
two methods of performing a motor task such as assembling a pump. 
Two classes of individuals are available as subjects; therefore we teach 
method A to class I, and method B to class II. The results show that 
class I using method A assembled 50 per cent more pumps than class II. 
From these findings it cannot be concluded that method A is better 
than or even as good as method B until it can be shown that other factors 
not part of the. two methods were operating unsystematically or in a 
nondeterminate manner between the two groups of subjects. 

One such factor would be the mechanical ability of the subjects. It is 
very probable that regardless of methods of assembling, individuals with 
high mechanical ability would assemble more pumps than individuals 
With mediocre mechanical ability. In the example, the mechanical ability 
of the subjects was an unwanted systematic variable, but in the proce- 
dure used for selecting the subjects no attempt was made to determine 
the relative amount of mechanical ability represented in the two classes. 
It is obvious that it is neither desirable nor necessary—nor even possible 
to eliminate mechanical ability from the experiment, but it is important 
that this ability be forced to vary equally within the conditions of the 
two methods so it will not operate to determine the relative assembling 
achievements of the two groups. 

One way of equalizing the effect of mechanical ability is to measure 
all of the subjects in this particular ability, arrange them in pairs of 
€qual ability, and then randomly assign the individuals of each pair so 
that the two groups will have approximately the same average mechani- 
cal ability. This selection, however, might not equate some other un- 
wanted factor that would have a systematic effect upon the results. 
The best method to use when several unwanted systematic factors are 
Present and cannot be measured is to assign the subject to the two meth- 
ods according to some random or chance method, but without reference 


98 Some General Concepts about the Scientific Method 


to any particular ability. In this way all of the unwanted factors will have 
an equal opportunity of getting into the two groups to the same degree, 
and so their general effect upon the two methods will be equalized. Any 
systematic difference in performance of the two groups cannot then be 
ascribed to these factors and will have to be attributed to the variables 
on which a difference between the groups was purposely arranged. 

Measuring Unwanted Variables as Unsystematic Variables. When by 
a random selection and assignment of the subjects we force unwanted 
systematic variables to operate equally in all conditions of an investi- 
gation, we actually make these variables contribute unsystematically to 
the final average values. Their contribution then cannot be separated 
from the effects of “bona fide” unsystematic factors, and are evaluated 
through the use of special statistical procedures devised for measuring 
unsystematic variation. What is accomplished then is not a complete 
elimination of these disturbing factors, but a forcing of them into a form 
in which their contribution can be evaluated statistically. 

Control through Selection of Data. Suppose we set up a research 
project to study the relationship between restriction of side vision and 
accident frequency, utilizing data available in the files of the motor- 
vehicle department of some state government. It is obvious that we 
must be able to study accident drivers with restricted fields of vision. 
We would also need to select a group of accident drivers with normal 
side vision. By examining driver accident record cards on which side- 
vision test scores are recorded we could prepare a roster of accident cases 
with and without restricted side vision. In examining the accident rec- 
ords of these cases we would find that some of the accidents would be 
referable to excessive speed, slow reaction time, or some other similar 
driver characteristic. Such factors might operate differentially between 
the two groups, and thus as systematic variables they would contami- 
nate the end results. To control these unwanted systematic factors we 
could select accident situations in which side vision was favored as a 
possible determinant factor. Two such situations would be intersec- 
tional accidents and cutting-in accidents, in both of which side vision 
is an important driver qualification for safe driving. Selecting accidents 
for study that occur in such restricted situations would tend to empha- 
size side-vision factors and to minimize the effects of unwanted sys- 
tematic determiners. 

Control through Statistical Analysis. When means are available for 
measuring an unwanted systematic variable, it is sometimes advantageous 
to control it through statistical analysis. This requires that the factor be 
allowed to vary systematically and that measurements be made of these 
variations. Suppose that in the investigation on the factors conditioning 
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college success our purpose was to study the contribution of scholastic 
aptitude to college success. We would then classify high school prepa- 
ration and reading ability as unwanted systematic factors, and we would 
be required to eliminate their contribution from the end results. As 
explained earlier, in such a problem the procedures of partial correlation 
enable us to study the effect of a given variable when the effects of other 
variables are held constant. We would then learn the contribution of 
scholastic aptitude when the effects of high school preparation and read- 


ing ability were held constant. 


THE CONTROL OF UNSYSTEMATIC VARIABLES 


The Problem. It will be recalled that an unsystematic factor is one 


that influences a given phenomenon both positively and negatively in 
o in the long run it presumably does not 


Unsystematic factors, however, do in- 
ns of the phenomenon. Suppose 
a group of 1,000 college sopho- 


about equal amounts, and s 
change the final average values. 
crease the variability of the expressio 
we administer a research aptitude test to 
mores, We test them in small groups during one semester, giving the 
test at different hours of the day on different days of the week, in dif- 
ferent experimental rooms, and using several test administrators. Under 
these conditions test performance will be affected by variations in hourly 
efficiency, by variations in efficiency from day to day and from week to 
week, by variations in the room heat, ventilation, and light, and by any 
variations in the testing procedures of the different administrators. Some 
students will do better than their “true” average, others will do poorer. 
In the long run with a group of this size the number doing better is as- 
sumed to offset the number doing poorer, and so it is assumed that the 
mean performance of the group will not be changed. The variability of 
the group, however, will be increased above what it would be with the 
unsystematic factors eliminated. With variation in these factors elimi- 
nated, every individual would perform nearer to his average ability, and 
therefore the total variability of the group would be smaller. 

The variation in an individual’s performance that occurs from unsys- 
tematic factors increases the difficulty of finding in the systematic variable 
the most representative or “true” measure of performance of the indi- 
vidual. It is then said that a less accurate or reliable measure of the 
true” performance is obtained when unsystematic variation is present, 
and that the accuracy decreases as the amount of unsystematic variation 
increases, This same logic is applicable to the measurement of the aver- 
age performance of a group of individuals. The greater the amount of 
unsystematic variation the less reliable the determination of the value 
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of the “true” average performance of the group. It is obvious then that 
the effects of unsystematic factors are in need of control and should be 
eliminated whenever possible. 

So many of the factors producing unsystematic variation are un- 
known that it is difficult to know just what kind or degree of control is 
effected by any given procedure. The general objective is to reduce as 
much as possible the variation in the subject’s responses that seems to be 
determined by factors not part of the experimental variables under study. 

Control through Physical Manipulation. If the investigator has knowl- 
edge concerning a possible unsystematic factor, he may be able to elim- 
inate it by direct manipulation of the physical characteristics of the ex- 
perimental conditions. Suppose we become embroiled in an argument 
about the difference in speed of running of football players. We main- 
tain that the backfield players are about 10 per cent faster than linemen, 
while our opponents argue that the difference is about 30 per cent in favor 
of the backs. We want to set up an experiment and collect empirical 
data for settling the dispute. We decide that a dash-run over a short dis- 
tance would be a fair test, and then go about to measure the speed of 
running of a large number of individuals who classify themselves as 
backs and a large number who classify themselves as linemen. 

The experiment is conducted at different schools, the running being 
over distances varying from 50 to 75 yards, on variable running surfaces 
(turf, gravel, asphalt), and the subjects wearing a varicty of track, 
football, and everyday shoes and clothes. Obviously, under such variable 
conditions, many unsystematic factors would be functioning, which might 
not affect the final mean running times, but which certainly would 
greatly increase the variability of these times. To eliminate these factors 
a standard set of conditions could be established. These would include 
a course of 50 yards, the running to be done on turf, the subjects to wear 
regulation football clothes and shoes, the running to start from a standard- 
ized crouching position, etc. 

Control through Selection of Materials. By careful selection of mate- 
rials, the experimental situation can be made more homogeneous through 
controlling factors that might produce unsystematic variation. This can 
be illustrated in experiments on the attention value of advertisements. 

Suppose the problem is to investigate the effect on attention value 
of variation in the size of advertisements. The attention value of adver- 
tisements can be measured by the degree to which they are recognized 
or recalled some time subsequent to the original exposure. A procedure 
sometimes followed is to select the materials from popular magazines. 
We select advertisements that vary in the following page sizes: 4,14, 1, 
34, and full page. It is obvious that when we select advertisements accord- 
ing to size there will also be variation in such factors as number of 
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words, nature of type, over-all composition, nature of message, color, 
illustrations, and so on. Some of these variables can be removed entirely. 
For instance, color can be removed by using only black-white advertise- 
ments, The nature of the message would be very difficult to control. 
Meanings aroused by the message and later remembered provide a 
measure of the attention value. These meanings would change with varia- 
tion in the message. We would not want to use the same message in all 
of the advertisements, however, as a given subject would be used to 
react to advertisements of different sizes. To control the effect of message 
on attention value we would introduce the same kinds of variations in 
message in each of the different sizes of advertisements. We would also 
endeavor to keep variation in the message as small as possible consistent 
with getting an accurate measure of the attention value. 


Control through Selection of Subjects. Here the objective is to obtain a 
s—the factors on which the homogeneity 


homogeneous group of individual 
ould operate to produce unsystematic 


is based being those factors that w 
errors. 
In our experiment on the spe 


men we could use subjects widely v 
experience, We could include players from junior high schools, high 
schools, preparatory schools, junior colleges, colleges, universities, ama- 
teur teams sponsored by business establishments, and professional teams. 
We could include those who had just learned the game, those at the 
height of their success as players, those who are almost through with 
active participation, and those who had long since hung up their “mole- 


skins.” We could include those who are “eager beavers” and those who 


take the game “in stride”; those who are in condition and those who are 
not, etc. Including representatives from such widely varying groups 
would introduce many unsystematic factors. Again, these factors might 
not affect the final average running times determined for backfield and 
linemen, but they would greatly increase the variability of these times. 
By accurately selecting the subjects we could make the groups 
homogeneous in reference to many unsystematic factors without making 
them unrepresentative of backs and of linemen in respect to speed of 
running. In our experiment this might mean that we would select only 
individuals having certain definite characteristics, such as being college 
players who like the game, who have had at least two years of experience 
ina major college, who are currently playing and are in good physical 
Condition, etc. , 
Control through Selection of Data. The controlling of unsystematic 
factors by this procedure can be illustrated in the study of the relation- 
ship between restriction of side vision and frequency of accidents. Hav- 
ing selected the two groups of accide: vers with re- 
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stricted side vision and drivers with normal side vision, our next task is 
to determine the frequency of accidents of the two groups. In examining 
the accident records of these drivers we would find many accidents re- 
corded that are not attributable to these drivers, that is, accidents brought 
about by other drivers, or by car, road or weather conditions over which 
our selected drivers had no control. In selecting the accidents to be used 
in the analysis we want to make sure that accidents referable to these 
other factors are not included, because of the possibility of introducing 
unsystematic variation. We would select for study only accidents for 
which our drivers were responsible. In doing so we would reduce the 
heterogeneity of the accident determiners and thus minimize the opera- 
tion of unsystematic variables. 

Control through Statistical Procedures. When examined closely it is 
found that the frequency of occurrence of the effects of unsystematic 
factors follows a well-established law. We have learned that, on the 
average, unsystematic factors increase and decrease the values of the 
experimental variable in about equal amounts, Furthermore, it is known 
that the effects of unsystematic factors on the experimental variable vary 
in amount from small to large. Small changes from these factors occur 
more frequently than large changes, and as the amount of the unsystem- 
atic effect gets larger the frequency of its occurrence gets smaller, 
following a definite form of decrease in frequency. The distribution of 
these unsystematic error effects above and below the “true” value of the 
experimental variables is often called the “law of error” or the “curve of 
error.” It has definite properties that are computable through the use of 
appropriate mathematical formulas. We can then estimate the possible 
quantitative contribution of unsystematic factors to the experimental 
results. 

An example will further clarify the distribution of these unsystematic 
errors. Suppose we desire to learn the reading rate of a particular indi- 
vidual for a particular type of reading material. We want this rate to be 
independent of the time of day, the cooperative efforts of the subject, his 
interest in the outcome, the lighting and ventilation of the room, and 
other similar factors which we assume will operate to produce unsystem- 
atic errors. On any given day, each of these factors will have a particular 
influence on the reading rate, but the end effect will differ from day to 
day; one time being positive, another time being negative; occasionally 
being large but more often being small. The reading rate will then 
fluctuate from day to day, and if other factors are constant, these fluctua- 
tions will be due to the unsystematic factors, 

The results of such a study are represented in Fig. 1. The reading rate 
varies from 235 to 265 words per minute. The rates of medium size occur 
more frequently, the rates of extreme size less frequently, there being a 
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definite curvilinear relationship between the size of the rate and its 
frequency. 

We can evaluate the magnitude of variation represented in the read- 
ing-rate distribution curve and can estimate the contribution to this vari- 
ation of unsystematic factors. Statisticians have provided precise pro- 


cedures for this purpose. 


5 
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Frequency of occurrence 
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230 235 240 245 250 255 260 265 270 
Words read per minute 
Fic, 1, Distribution curve representing variation in rate scores of one individual on 
è reading test. 


The average reading rate is 250 words per minute. We can feel 15 
Confident that this value accurately represents the ria abi ity 
to read under the conditions that we presented him. This conc usion is 

ased on several facts, one of which is that the positive pena in rate 
offset the negative variations and therefore the average value remains 

ividuaľ’s “true” reading ability for the set of 
Lepresentative of the individuals “true” ree 


Conditions i 6 im. 
Pia aie one statistical procedures for evaluating the con- 
tribution of unsystematic errors is beyond the scope of this book. It is 
important, however, that we know that accurate means are available for 
estimating the effects of these errors. pot A procedures provide 
à very i : rol over unsystematic var . 
he 5 pen The scientist usually is se to 
Study samples of his phenomenon because of the ee e e 
Mstances the impossibility—of studying all of its expressions. Errors may 
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arise because the sample studied is not representative of the population. 
These are known as “sampling errors” and are classified as one form of 
unsystematic error. They are evaluated by the statistical procedures re- 
ferred to above. 
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Research Methods in Social Rela- 


CHAPTER 6 


Some Facts and Principles of Psychological 
Measurements 


The most rigorous kind of description involves the assignment of number 
Meanings to the variables under study. When accurately assigned, number 
meanings make possible greater precision in description and provide 
the basis for exact quantitative comparisons. 

With the growing application of scientific methodology in the study 
of behavior, there has been a corresponding increased demand for quan- 
titative description and for mathematical treatment of psychological data. 
But adoption in unmodified form of the analytical procedures of mathe- 
Matics and statistics has seldom been completely successful. Psychologi- 
cal variables have unique properties which make a blind use of the 
Procedures dangerous. The application of statistical methods to psycho- 
logical data has had a short history, but when we keep in mind the com- 
Plexity of psychological determinants, we are justified in concluding that 
remarkable progress has been made toward the goal of describing be- 


navior quantitatively. 
Y MEANS OF MEASUREMENT 


THE ORDERING OF VARIABLES B 


Past discussions have emphasized the importance of finding order in 


any data that have been collected by means of the scientific method. 
Through the process of ordering, many meanings are discovered or 
evolved that are not apparent in the first examination of the data. In the 
Procedures of quantitative description we have available additional de- 
Vices for ordering data and for discovering many significant meanings. 
It is important that we consider the w h psychological variables 
Can be ordered in terms of the principles of measurement. 

The Use of Frequency of Occurrence in Ordering Variables. In the 
Procedure of counting the frequency of occurrence of a phenomenon, 
We have a ready megas of introducing quantitative description into our 

i on very often is reflected in the 


analysis, The importance of a phenomen 
105 


ays in whic 
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number of times it takes place, and its relations with other phenomena 
are bound up with its frequency of occurrence. It is possible, by the 
simple mathematical process of counting, to discover important quanti- 
tative meanings of the phenomenon. 

The Number within a Class. In an earlier chapter, description in the 
form of classification was discussed. To classify, we must be able to 
recognize in each event being studied the presence or absence of the 
particular characteristic on which the classification is made. This enables 
us to place each event either within or without the class. By applying 
the counting process to the number that is within the class, we are able 
to increase the precision of our description. The total number within the 
class is a new meaning that we can apply to the class as a whole. It is a 
quantitative description of the size of the class. 

Suppose we are interested in studying the attendance of children at 
the motion picture Quo Vadis. We can separate all individuals attending 
the show into those below the age of twelve and those above this age. 
This is readily done by sorting the tickets by color, the tickets for chil- 
dren being a different color from those for adults. We can now count up 
the number of children’s tickets and thus obtain the number of children 
attending the show. 

Additional Meanings through Comparisons. Sometimes the number 
within a class does not tell us all we need to know. We oftentimes wish 
to know whether this number is large or small, and so we need to 
evaluate its size in terms of some other number related to it. This other 
number may be either the number of individuals classified as not falling 
in the class, or the total number of individuals being classified. We can 
get a more accurate understanding of the size of the class by comparing 
the number in the class with either of these other numbers. 

In our example, the meaning or importance we can assign the number 
within the class is strikingly different for the situation of 50 children 
and 450 adults than for the situation of 50 children and 50 adults. In 
other words, the number of individuals not in the class gives us additional 
quantitative information about the class itself. We now have a quantita- 
tive meaning based on relative frequency, that is, the frequency within 
the class compared with the frequency not in the class. 

Comparisons Using Per Cent Frequency. By stating the numbers in the 
classes as percentages, it is sometimes easier for us to make a judgment 
about relative frequency. The total number of individuals considered is 
given the value of 100, and the number in the class is expressed as 4 
proportion of this value. In our example, the per cent of children attend- 
ing the show in the first situation is 10 and the per cent of adults attend- 
ing is 90. 

One of the reasons for using percentages is that we have a common 
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in terms of which the numbers of several situations can be expressed. 
uppose we conduct our study of attendance at Quo Vadis on three con- 
secutive days and obtain the count shown in the table below. A compari- 


i Per cent 
Children Adults children 


Day 1 i 
attending | attending attending 
1 301 34 D 
2 230 20 80 
3 290 25 80 


2 with those in column 8 tells us that 


Son of the frequencies in column 
ded the show on each of the three 


many more children than adults atten 
days. For this conclusion we do not need to use percentages. The com- 
Parison of the attendance on the three days, however, is more difficult to 
make when we use the original numbers. In column 4 we have expressed 
the numbers of the second column in terms of per cent. It is seen that the 
Per cent of children attending was the same on all three days. Per cent 
Scores make it possible, then, to express in terms of the same base the 
relative frequencies of different situations. When the original numbers in 
the different situations vary in magnitude, as in our example, the per cent 
relative frequency greatly facilitates the comparisons we desire to make. 

Examples of Ordering in Terms of Frequency. Ordering by means of 
Fequency of occurrence is a common procedure in science. The physicist 
las the Geiger counter by which he measures the amount of radiation in 
Studies of atomic energy. The physiologist counts the number of blood 
Cells seen on a calibrated slide under a microscope and from his count 
makes quantitative statements about the content of an individual's blood. 
The psychologist uses this procedure when he evaluates individual per- 
W in terms of the units of work achieved, such | the ieee = 
rials to lear number of pages of text read in an hour, the 
number of 1 tly answered, the number of 


items forgotten after a given time interval, etc. When behavior is evalu- 
ated is then done in terms of the number 


a. in terms of accuracy, ordering 1 Pelton 
orrect responses or the number of error responses. ollowing are some 
examples of measures of error performance: the number of misspelled 
Words in a spelling test, the number of mistakes in typing ten pages, the 
number of entrances into blind alleys of a maze, the number of false leads 
attempted in the solution of a reasoning problem, etc. 

It should be emphasized that in this very simple mathematical procedure 
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of counting we possess the means for discovering very important quanti- 
tative meanings in our data. 

The Use of Relative Amounts in Ordering Variables. To order by 
frequency of occurrence, we first needed to be able to recognize the 
presence and absence of the particular characteristic on which the 
events were to be classified, and then we needed to be able to count. 
Nothing was said about the relative amounts of the characteristic pos- 
sessed by the events to be ordered. With knowledge of the relative 
amount of the characteristic we are able to increase further the precision 
of our description. 

The Problem. Suppose we are interested in knowing who the best 
player on a baseball team is, or which employee in a factory turns out 
the most work, or which student at a university should be given a special 
scholarship. In these examples we need to go beyond the procedure 
of counting and ordering by frequency. In order to make the judgments 
indicated in these cases we must know more than just which players, or 
employees, or students are in a given class and how many there are in 
this class. We need to have information about the relative amount of the 
characteristic—that is, we need to be able to detect differences in amount 
of the characteristic possessed by the members falling within a class. 

The Procedure. First, for any given problem, we must classify the indi- 
viduals. For example, the students eligible for university fellowships are 
separated from those who are ineligible. As we have already seen, this 
step is prerequisite for any type of ordering. Next we must have informa- 
tion concerning the relative amount of the characteristic. Obviously, the 
characteristic must exist in varying amounts, and we must be able to de- 
tect at least in a rough way the amounts possessed by each individual 
in the class. In our three examples, there must be information about the 
players’ effectiveness in the game, the employees’ production in the firm, 
and the students’ productivity as scholars. 

Before we can arrive at a decision as to who is the best player, or 
most productive employee, or most scholarly student, we must rank some 
of the classified individuals in terms of the amount of the characteristic 
being studied. Actually, for any one of our specific examples we would 
not need to rank every individual who was classified. We could first 
select only the ones having high amounts of the characteristic and rank 
these cases in order from poorest to best. We would then be able to 
discover the person showing the best performance. 

In some problems, we are required to rank all of the individuals in a 
class. In preparing a list of eligible persons for a civil-service job, it is 
necessary to rank all of the eligible applicants from poorest to best. 
Seniority lists for promotions usually require that all eligible candidates 
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be placed in rank order. In psychological experiments, the rank ordering 
of all the participating subjects is usually necessary. In these problems 
reliable information must be obtained on every member of the group 
under study. 

; In order to rank individuals in terms of the amount of some character- 
istic possessed by them, it is not necessary actually to make a measure- 
ment of the amount of the characteristic possessed by each one. We must 
only be able to tell that one individual has more or has less of the char- 
acteristic than another individual. We must be able to state that A > B > 


C > D, ete. This judgment demands that we merely be able to observe 
individuals; it therefore need not be 


these amounts. For example, we 
in order of their heights by com- 
he actual height of any 


differences in amount between 
based on an actual measurement of 
can rank the individuals of a group 
paring one with another without first measuring t. 
individual. 


Ordering by Relative Amount in Terms of Categories. Sometimes in 


working with large numbers of data we find it convenient to classify them 
into categories according to relative amount. Each category differs from 
those adjacent to it in terms of average amount, but its limits are not 
Precisely determined in reference to the limits of these adjacent cate- 
gories. A good example of this procedure is the letter-grade system of the 
schools. For instance, in a given class in history the teacher learns 
through class discussions and essay examinations that some students are 
better than other students. A crude estimate of the amount of knowledge 
of every student is expressed in one of the following categories: A (very 
high), B (high), C (average amount), D (low), F (very low). The cate- 
gories are placed in rank order, and thus students in different categories 
fall at different ranks. The students within a category are not arranged 
in rank order, This rough ranking of the students is all that is justified 
When exact differences in the amounts of the characteristic are not 
available. 

Sometimes an ordering of 
expressed as per cents. There are two gener 


has few facts to justify its use; the other is sup 


tests, The first is exemplified by a teacher who distinguishes the per- 
¿amination in terms of very small 


formance of his students on an essay €X 0 

Percentage differences. Perfect performance is given the value of 100, and 

the performance of each student is judged in reference to this value. The 

examination papers are not merely ju ged to fall in one of several large 

related categories but are distinguished in some absolute sense and as- 

Signed exact per cent values, such as 70, 71, 72, or 92.5, 96.4, 98.8. It has 
een demonstrated that examiners ble of distinguishing very 


the individuals is made in terms of scores 
al procedures. One procedure 
ported by sound empirical 


are not capa 
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small differences in this type of examination performance. Assigning per 
cent values that differ by very small amounts gives the appearance of 
attaining a high precision. This is not justified by the facts. 

In the second procedure, a few large categories are defined in terms 
of the proportions of the group that they are to include. For example, in 
grading classes in college the instructor might use the following per cent 
categories: 8 per cent As, 20 per cent B's, 44 per cent Css, 20 per cent 
D’s, and 8 per cent Fs. In assigning certain proportions of the students 
to letter-grade categories it is not argued that we have in any way in- 
creased the precision of our measurements. The number of categories to 
be used should be determined by the accuracy with which the discrimi- 
nations among individuals can be made. 

Comparisons Using the Relative Amount Ordering Procedure. When 
we have sufficient knowledge about the amount of the characteristic 
possessed by every individual in a group so that we can place them in 
rank order, we can then make comparisons between each individual and 
every other individual in the group. If our measures are somewhat crude, 
we may not be confident of the accuracy of our estimate of the difference 
between adjacent ranks. That is, we may not feel confident that rank 1 
is better than rank 2, rank 2 better than rank 3, or rank 3 better than 
rank 4, We may still be confident, however, that ranks which are two 
or three steps apart are sufficiently accurately assigned to accept the 
difference between them as significant. That is, we may feel confident 
that rank 1 is better than rank 3, or that rank 1 is better than rank 4. 
The confidence we have in our comparisons, whether the individuals are 
one step or more than one step apart, is dependent upon the accuracy of 
judging the relative amounts of the characteristic possessed by the in- 
dividuals. 

When categories are used, we can compare the performance of the 
individuals falling in the different categories. The further apart the cate- 
gories in which the individuals fall, the greater confidence we can have 
in our comparisons. It should be obvious that errors will occur even when 
categories are used. For instance, an individual who is just barely good 
enough in his statistics course to be given a B grade is not much better 
than the individual who barely misses a B grade and is therefore 
given a C. 

Ordering Involving the Measurement of the Amount of Difference in 
the Characteristie. In our previous discussion, we were concerned pri- 
marily with judging whether one individual had more of a given char- 
acteristic than did another individual. We were not concerned with the 
question of how much more. This question, however, is important in all 
types of scientific measurement. To get an answer to it requires a high 
level of precision in our measuring tools. 
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The Problem. We have previously observed that individuals in a group 
can be ranked in terms of the magnitude of some characteristic without 
measuring the amount of the characteristic possessed by each one. We 
need to know merely that A > B, B > C, C > D, ete. There is nothing in 
this procedure that enables us to tell whether the distances between the 
adjacent ranks are of equal size. The difference between A and B might 
be many times greater than, or merely a fraction of the size of, the differ- 
ence between B and C, or the difference between any other two adjacent 
ranks, Greater precision in description results when we know the rela- 
tive sizes of the differences between ranks. Even though these differences 
are not equal, knowledge of their relative sizes greatly increases the 
accuracy of our comparisons. 

Suppose we have sufficient information 
able to rank them on some trait X as in 


d n Be 
Under these conditions, our comparisons of any one with any other one 
are more accurate than if we merely know that A > B, B > C, and so on. 
Obviously, this greater precision in measurement can be achieved only 
with a more accurate estimate of the differences in amount of the char- 
acteristic possessed by the individuals under study. 
The Meaning of Amount of Difference. To realize the type of descrip- 
tion now under consideration, it is necessary to measuré the amount of 
the difference in the characteristic. The question goes beyond that of: 
Is A greater than B? to the question: How much is A greater than BP 
It should be noted that we are not asking the question: How much is AP 
Ve are concerned here with finding, not the absolute amount of A or the 
absolute amount of B, but the best estimate of the absolute amount of 


the difference between A and B. 


he amount of difference between in 
Characteristic can be realized without knowing the exact amount possesse 


Y each of the individuals. It is true, of course, that if we know the 
exact amounts of a characteristic possessed by two individuals it is a 
‘imple matter to compute the exact difference in amount between the 

vo. We should note, however, that it is not necessary to know the a 
amount possessed by each individual in order to measure the amount 0 
the difference between them. This becomes obvious when we consider a 
Simple problem in determining the difference in heights of two persons. 

e stand them close together and measure from the top a 5 of 
One to the top of the head of the other. We can state the exact difference 
Without knowing the position along the height dimension at which either 


Individual falls, 


about five individuals to be 
dicated on the line below: 


dividuals in terms of a common 


112 Some General Concepts about the Scientific Method 


Comparisons Possible with This Procedure. When we can measure 
amounts of difference in a characteristic, we can compare in a more pre- 
cise manner individuals who differ in the amount of the characteristic. 
Referring to the illustration on page 111 of the amounts of performance 
of five individuals represented as A, B, C, D, and E, it will be noted that 
the differences between adjacent individuals are not the same. The graph 
was prepared so that the distance from A to B is twice that between 
B and C, three times that between C and D, and equal to the distance 
between D and E. Suppose that the performance scores of the five indi- 
viduals were separated by amounts of score differences comparable to the 
linear differences of the illustration. It is then possible for us to make 
several comparative statements about the individuals, such as the 
following: 


The difference in performance between A and B is twice the difference 
between B and C. 

The difference in performance between B and C is one-third greater 
than the difference between C and D. 

The difference in performance between B and C is half the difference 
between D and E. 


Other similar statements are possible. 

Order Involving the Measurement of the Absolute Amount of the 
Characteristic. As indicated in the preceding two sections, ordering of 
the expressions of a variable can be achieved without the measurement 
of the absolute amount of the expressions. The highest level of quantita- 
tive description is obtained, however, when it is possible to measure 
the absolute amount of these expressions. 

The Problem. Sometimes we desire to know more than that one per- 
son’s behavior is superior to another's by a given amount. We want to 
know what the level of superiority of each individual is; that is, we want 
to know how well each person performs. This knowledge requires meas- 
urement of the absolute amount of the characteristic under consideration. 

Suppose we are placed in charge of the production of an assembly 
line in which electrical wall switches are being assembled. It is important 
that we keep account of the absolute number of switches assembled by 
each worker during some constant time interval such as a week. One 
purpose for which such knowledge is needed is for determining the cost 
of manufacture of the switches. This information may be fundamental 
to decisions concerning the effectiveness of the workers and the methods 
of work. 

Type of Comparisons. With knowledge of the absolute amount of 
performance, it is possible to compare different individuals in absolute 
terms. For example, suppose worker X assembled 75 switches and worker 


Some Facts and Principles of Psychological Measurements 118 


Y, 50. We not only know that the performance of X is better than that of 
Y and that the difference between the two performances is 25 switches, 
we also know that the performance of X is 114 times greater than that of 
Y. By measuring every person's performance in absolute terms, any indi- 
vidual’s performance can be taken as a standard and the achievements 
of the others expressed in terms of it. This type of ordering can be done 
in psychological measurements when we are ordering the actual perform- 
ances, When we are concerned with the ability basic to the performances, 
we find that procedures are not yet available for expressing the ability 
in absolute terms. Further consideration will be given to this problem 
in subsequent discussions. 

Need of Absolute Zeros. In order to measure in absolute terms we 
must have an absolute zero as a point from which to make the measure- 
ments, The nature of and need for zero points in measurement in general 
and in psychological measurements in particular are discussed in later 
Sections of this chapter. 

Four Levels of Precision in Measurement. It should be apparent that 
the precision of the measuring procedure available determines the accu- 
racy with which we can quantitatively describe the order discovered in the 
characteristic, Ordering by frequency requires only classification and 
Counting. It might be considered a form of ordering by amount, os 
amount is represented by the frequency of occurrence of ce comp na 
unit, i.e., event, phenomenon, individual. Ordering by diene D by r 
amount of some characteristic of the event, phenomenon, or W n 
requires measurement. If the procedure of measurement allows " to 
differentiate crudely differences in amounts possessed by the 5 bers 
of a class, we can place them in order of rank. If the 5 mas 
urement makes possible an estimate of the amount of the di — . 
tween members of the class, we can perform a more 1 iF a 155 
exact ordering of the relative amounts of the behavior: 5 70 
amounts of the behavior being measured are . j A niente 
compare individuals by selecting the performance © * pete of this 
and expressing the performances of all other persons 


standard. 


; N ION 
THE USE OF NUMBER MEANINGS IN DESCRIPT 
As discussed in the foregoing sections. i a he o ro 
Plished by assigning number meanings to the characte 


; r meanings. The pre- 
even i bn] different kinds of numbe ng 
t. There are several opo vation of the accuracy with which the 
u 


Cision H ! 

1 5 2 sa 5 8 20 poner 
3 and events differ in respect to the particu 
a 


p umber meanings are assigned, 
ar number meanings applicable to them. 
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Kinds of Number Meanings. Numerical symbols are readily used to 
represent different amounts of any behavior characteristic under consid- 
eration. Furthermore, as abstract symbols, they can be subjected to all 
of the manipulatory operations of mathematics. It is not always easy to 
establish that the meanings assigned to the symbols are applicable to 
the characteristic, and this is a point at which errors often are introduced 
in the measurement of psychological variables. 

Assigning the Meaning of Magnitude. Numbers are concepts and are 
characterized by variations in the meaning called magnitude. As we have 
seen, experienced events also vary in this meaning or dimension of mag- 
nitude. We achieve measurement by assigning a number to some char- 
acteristic of an event. The fundamental condition that we must meet is 
that the number must correspond in magnitude with the amount of the 
characteristic. This requires that our knowledge go beyond the mere fact 
that the characteristic varies in amount. We must be able to determine, 
either roughly or with precision, the amount of the characteristic possessed 
by the particular event to be described. Measurement, then, requires spe- 
cial procedures by which we can determine the magnitude of the char- 
acteristic that we desire to describe quantitatively. We then assign to the 
characteristic a number that corresponds with this magnitude. 

Assigning Complex Number Meanings. Besides the meaning of magni- 
tude, numbers have other characteristics that allow us to manipulate them 
in the familiar ways of addition, subtraction, multiplication, and division. 
Under certain conditions, these more complex meanings can also be ap- 
plied to the particular psychological event being described. This is what 
we do in a spelling test when we subtract 50 words spelled correctly from 
75 words tried in order to learn the number of words missed. 

Of course, it is not possible to submit every event and its character- 
istics to all of these operations. There is, then, the problem of determin- 
ing and interpreting the relationship between the meanings of the numer- 
ical symbols and the meanings of the events. 

Necessary Conditions for Assigning Number Meanings. In order to use 
the meanings of numbers accurately in describing events, it is necessary 
that the events be of such a nature that the particular number meanings 
are appropriate. Some meaning that is assignable to the events must be 
congruent with the number meaning that we desire to apply. 

A homely illustration involving the meaning of addition should help 
make this point clear. If 12 oranges are added to 12 apples, the answer is 
not 24 apples, nor 24 oranges, nor 24 apple-oranges. The meaning must 
correspond to a characteristic that fits both oranges and apples, such as 
the meanings “pieces of fruit” or “object” or “thing,” e.g., 24 objects. 
If a meaning peculiar to each fruit is retained, then the description must 
contain the conjunction “and,” e.g., 24 oranges and apples. In either in- 


a 


event demand. Whether the amount o 
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stance, the meaning assigned the totality suffers a reduction in precision 
because the specific meaning of 12 assigned to both oranges and apples 
before the addition is performed is not either implicit or explicit in the 
meaning assigned the totality. In other words, the phrase “24 oranges 
and apples” does not include the meaning 12 of one and 12 of the other. 
Twenty-four oranges and apples means combinations of oranges and 
apples in any numbers that will add to 24, as, for example, 1 orange and 
23 apples. Although this point seems obvious and may appear inconse- 
quential in the example given, in the application of measurement to 
psychological phenomena a real and significant problem is involved. 

Advantages from Assigning Number Meanings. Singleness of Meaning. 
In general, numbers have singleness of meaning; that is, there is little 
disagreement in the meaning assigned a number by different persons. 
Freedom from ambiguity in meaning is one of the necessary conditions 
for accurate description. With singleness of meaning established in the 
number series, our task of achieving a high fidelity of correspondence be- 
tween the numerical meanings and the event meanings is made much 
easier. Our attention can be focused primarily upon obtaining singleness 
of meaning in the magnitudinal aspects of the events. 

Increase in Precision. Numbers are peculiarly appropriate as symbols 
for representing different amounts of a variable. The fundamental mean- 
ing of numbers being that of magnitude, differences in the meaning of 
numbers represent differences in magnitude. Variations in amount of any 
event characteristic can be represented by numbers varying in size. In 
this way a closer degree of correspondence is achieved between the event 
characteristic and the conceptual descriptive symbol than can ever be 
attained by the use of nonnumerical word descriptions. 

High levels of precision are obtainable with numbers because of the 
fine gradations of magnitude possible with numbers. Gradations in magni- 


tude in the number series can be as large or as small as the needs of the 
f the event gets progressively 


; umber that has a corre- 
smaller or progressively larger, we can find a n 


Sponding meaning of magnitude. , 
Universal Application. Lastly, the abstract nature of numbers gives 


them universal application. As adjectives, they can be “3 to ee mi! 
Measuring unit. They can be used to modify Ek 0 — A nih o 
. units of speed, units of te units of test performance; in 
SIO i it of measurement. 

(eo case vat ee to Psychological Variables. As we learned 
earlier, there are several conditions that must be met before we can as- 
sign quantitative meanings. Number meanings n be assigned 55 a vari- 
able only when the characteristic of the variable under consi ir a 
exists in differing amounts and when these different amounts can be 
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readily identified. Accurate description also requires measuring proce- 
dures by which the amounts of the characteristic can be estimated. 
Lastly, only when there is some meaningful correspondence between the 
number meanings being applied and the meanings of the characteristic 
being described can we claim that valid measurement has been achieved. 

It is worthwhile for us to examine ways of interpreting the correspond- 
ence required between the meanings of the number symbols and the 
meanings of the observed psychological variable. 

Ignoring the Meanings of the Psychological Variable. According to 
this point of view, number meanings are assigned to the scores or meas- 
ures of a variable without due regard for the empirical meanings of the 
variable. Following the assignment of the number meanings to the 
scores, the mathematical manipulations then proceed without further 
reference to the empirical meanings. To illustrate this interpretation, let 
us consider an imaginary set of scores. Suppose that we have collected 
performance measures on the following variables: mental age, height, 
reaction time, reasoning ability, and memory for names. We correlate 
the scores of each variable with mental age and get fairly low relation- 
ships. Then we continue trying out many different combinations of the 
scores until we find that if we add height scores to memory for names 
and divide by the reaction time we get a measure that correlates rather 
highly with mental age. We then proceed to compute such measures and 
use them for predicting the mental ages of children. It should be appa" 
ent that the empirical meanings of the several test scores are lost sight 
of in the processes of analyzing and combining in order to get a com- 
posite score that correlates significantly with mental age. Adding height 
to memory for names and dividing by reaction time does not appear to 
have a corresponding meaning in the psychological world. 

It is difficult to conceive of any scientist endorsing this point of view: 
Even the theoretical physicist, who has developed the application 0 
mathematics to scientific subject matter to the highest degree yet achieved. 
implies in his discourse that he is talking about nature. He is not merely 
referring to the abstract meanings that he has developed through ra- 
tional manipulation of mathematical symbols. 

Requiring Meanings That Are Mathematically Manipulable. This in- 
terpretation states that we should apply number meanings to only thos¢ 
characteristics of a variable that can be sensibly manipulated as require 
by the particular mathematical process to be applied. Certainly, mean” 
ings obtained through the application of a mathematical equation may 
or may not be applicable to the empirical characteristics of the variables 
from which the function and the equation were developed. Symbols in 
an equation can be manipulated in ways that cannot conceivably be 
applied to the original psychological data. 
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Suppose we have scores on two psychological variables to which we 
wish to apply the arithmetic process of addition. The two variables 
should have a common meaning that can be subjected to this arithmetic 
manipulation, or the final outcome may be either erroneous or absurd in 
meaning, or both. Let us consider an illustration. John’s spelling ability 
is measured by his response in correctly spelling 90 words out of 95. His 
arithmetic ability is measured by his correct solving of 80 problems in 85. 
We now average his two scores and conclude that his average perform- 
ance in the two subjects is 85, i.e., (90 + 80)/2 = 85. 

Obviously the figure 85 does not apply either to arithmetic or to spell- 
ing, but to some new meaning that is found common to both of these 
factors. It is not apparent just what this new meaning is. It is difficult 
to say what is done when spelling ability is added to arithmetic ability 
and an average computed. To say that 85 is an average of the two abilities 
is absurd. Eighty-five is merely the average of the numbers 90 and 80, 
and does not tell us anything about the nature of the magnitude in the 
two variables presumably being assessed in the number 85. 

Using Predictive Propositions for Discovering Meanings. One way of 
applying number meanings is to form propositions stating functional 
relationships between psychological variables, and to use these funda- 
mental relationships in interpreting new situations. Such a proposition 


may contain conceptual elements, but these elements stem directly from 
the empirical facts. The proposition describes a probable functional re- 
lationship, and is expressed by means of numerical symbols. The func- 
tion, in the form of some numerical equation, is then applied to a new 
situation. The meanings of the terms of the predictive proposition, how- 
ever, must in some sensible way be applicable to the variables. If this is 
not so the predictions cannot be verified, and therefore they serve no 


Purpose. , 

Success in verifying predictive propositions leads us to believe that 
they do express meanings that are useful in interpreting and controlling 
Psychological variables. This is not to say that every’ meaning applicable 
to the numerical symbols in a functional equation is also applicable to 


the behavior being analyzed. Some meanings of symbols cannot be di- 
rectly applied to psychological variables. 

Let us consider an example. Suppose again that we have the scores 
al mental-abilities tests. We cannot actually add the 
We can, however, add his ability scores, 
in order to obtain one measure 


of persons on sever 
abilities of any given person. 
Combining them in some statistic 
for all abilities. The importance o 


al way 
f this new meaning, which cannot be 
directly observed in the psychological variables being described, derives 
from the fact that it enables us to discover further meanings that are 
useful. The averaging of the scores from the several ability tests given 
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an individual provides us with a new meaning that we can call the 
individual’s general ability. We are unable to find behavior processes 
that correspond to this general ability. Despite this, however, we can 
make predictive use of the individual's general-ability score. It can be 
used to improve our understanding and control of the individual's activi- 
ties. Thus we gain from the information provided by the general-ability 
score even though we have not found any empirical processes in the 
individual to which it can be directly referred or with which it closely 
corresponds. 

Limitations in Assigning Number Meanings to Psychological Variables. 
Psychological variables are difficult to measure. Human behavior has 
stubbornly resisted measurement. The quantitative aspects of complex 
behaviors are not readily identified, and procedures of measurement 
applicable to global types of behavior are difficult to devise. Most psy- 
chological measuring instruments now available are sensitive to changes 
coming from without the variable being measured, such as changes in 
the sample studied, in the person doing the measurement, or in the 
mode of applying the measuring instrument. Psychological measurement 
procedures often lack the stability that characterizes physical measure- 
ment procedures. These limitations are serious handicaps in applying 
mathematical equations to functional relationships in behavior. Not all 
psychologists are aware of them, and some of those who are aware of 
them still continue to apply their measuring techniques uncritically. 

Particularly must the psychologist be on guard against committing the 
error of assigning inapplicable number meanings to his variables. Unless 
he knows from the empirical evidence what the behavior processes are to 
which his numerical symbols refer, the value of his mathematical equa- 
tions is to be questioned. Under these circumstances, statistical manipu- 
lations applied to his data are suspect. One caution he can observe is to 
place less confidence in meanings that are abstracted so far from natural 
behavior processes that he cannot readily return to empirical situations to 
check them. A second caution he can observe is to refrain from using 
new meanings derived from mathematical equations until he can con- 
duct empirical tests to verify them. Demonstration of the applicability of 


the new meanings to behavior processes is a necessary condition of their 
acceptance. 


CONDITIONS NECESSARY FOR ACCURATE MEASUREMENT 


To measure an object, we apply some unit of measurement against the 
object and determine how many times the unit is contained in the object. 
The point at which we start applying the measuring unit is at no 


a 


2 
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amount,” and the point at which we discontinue applying the unit is at 
“all amount.” The amount of the object is represented as the number of 
units needed to go from the one point to the other. The two necessary 
conditions for measurement, then, are a unit of measurement and a zero 
point. 

Units of Measurement. Units in “The Thing Itself.” In many common 
Measuring situations, the unit of measurement consists of a small amount 
of the thing that is to be measured. For example, to measure length we 
use a small amount of length, such as the inch, or the foot, or the yard. 
To measure time we use a small amount of time, such as the second, or 
the minute, or the hour. In these situations, measurement is accurate be- 
Cause the amount of the thing being estimated is expressed in terms of 
identically the same kind of phenomenon as itself. 

Units in a Functionally Related Variable. Measurement is possible and 
can be done very accurately by using a unit of measurement that is not 
a small amount of the characteristic being measured but a small amount 
of some other variable which bears a functional relationship with this 
characteristic, 

In the measurement of weight we have an example that will make 
this clear, Weight can be measured either by a unit that is a small amount 
of the thing itself or by a unit in another variable bearing a constant 
functional relationship with the weight variable. When a balance scale 
1S used, the object to be weighed is placed in a pan at one end of a lever 
and small weights are placed in a pan at the opposite end. Here the 
Unit is a small amount of weight. In a spring scale, the weighing pan is 
Connected through a lever to a spring. Placing the object to be weighed 
in the pan stretches the spring, and a pointer that is fastened to the 
Spring is made to move along a printed scale of numbers indicating 

ifferences in the amount of weight. In the spring scale, the unit of meas- 

urement is a unit of length. A functional relationship is established be- 

tween the amount of weight and the linear extension or “stretch” of 
the Spring. 

hus, to measure weight it is not ne 
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sion on the rule in the process of using it. Measurements made at differ- 
ent times with this rule would not be equal but would vary slightly ac- 
cording to the changes in tension applied to it. 

Physical measuring units are sufficiently constant for most purposes 
of measurement. Although a steel rule will change in length with changes 
in temperature, these changes are usually so small as to be of no conse- 
quence in most situations in which steel rules are used. If we were con- 
cerned with measuring to an accuracy of a millionth of an inch, we 
would need to control temperature variations. The amount of change in 
the size of the unit of measurement that can be tolerated is, then, a func- 
tion of the purpose for which the measurement is being made. 

Zero Points. The Need of Stable Zero Points. Measurement involves 
comparisons. For accurate comparisons we must have some point to 
which we can refer our measurements. These comparisons may be be- 
tween different measures on the same individual or group, or between 
different individuals or groups in terms of the same measures. 

We are not justified in comparing two objects or processes if they 
are not measured from the same stable beginning point. Suppose we 
have two tape rulers, one of which is worn away on the zero end so it 
begins at the 14-inch mark. If we were to use the two rulers in measur- 
ing respectively the heights of two persons, these height measurements 
would not be referred to the same zero point. Again, if in measuring 
the height of two people we were to allow one to wear shoes and the other 
to stand in bare feet, we would not be measuring from the same point 
in the height dimension and our comparison would be in error. 

We are not justified in making comparisons of measures taken at 
two different times if the reference point changes its position between 
the two measurements. Suppose we introduce some change in an experi- 
mental variable between two measurements, and at the same time there 
occurs a shift in the zero point of the measurements. Any difference ob- 
tained between the measures taken on the two occasions could not then 
be attributed solely to the change in the experimental variable because, in 
part, it would be due to this shift in position of the reference point. What 
is needed in all measurement is a stable zero point. 

Zero as “No Amount of the Thing.” In everyday trade situations, meas- 
urement of objects usually begins at a point that represents no amount 
of the characteristic being measured. In such items as 2 pounds of meat, 
an 8-foot table, a charge of $20, etc., the measuring process is begun at 4 
point of no amount of the characteristic involved. This type of zero is 
often called the absolute zero because there is no possibility of getting @ 
value smaller. It should go without saying that the value of any given 
absolute zero is a constant, and therefore all measurements involving such 
a zero will be started from exactly the same reference point. 
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Zero as a Defined Stable Point. Situations frequently arise in which 
we wish to measure the amount of a characteristic but do not have an 
absolute zero from which to begin the measurement. The scale of hard- 
ness, which consists of 10 minerals varying in their hardness, does not 
have an absolute zero. The least hard mineral is tale, with a value of 1, 
and the mineral of greatest hardness is diamond, with a value of 10. 
Tale can be used as the point from which to begin measurement because 
under a given set of conditions its hardness remains constant. As a refer- 
ence point, the hardness of tale is a stable zero point. 

Consider the measurement of the psychological characteristic called 
intelligence. No usable absolute zero has been found for this character- 
istic, Although we can define an absolute zero as no amount of intelli- 
gence, we cannot devise a practical situation wherein measurement will 
begin from this zero. In this type of situation we can resort to a statisti- 
cally defined relative zero, such as the mean of a set of scores. It is im- 
Portant that such a relative zero be stable if the ensuing measurements 
are to be accurate enough for comparative purposes. 

Measurement with a Functionally Related Unit of Measurement and 
a Relative Zero. Accurate measurement is possible without units that 
are a small amount of the characteristic being measured and without 
Zeros that represent no amount of this characteristic. We can e 
the first difficulty if we find another characteristic with mae 
variations that bear a constant functional relationship eae pei 
tudinal changes in the characteristic to be measured. T can a 1 
the second difficulty by using a stable relative zero in place of an abso- 
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thermometers were first devised, the absolute zero of temperature was 
not known. An arbitrary set of conditions was adopted for defining a 
relative zero. On the centigrade scale, zero temperature is represented 
as that temperature at which pure ice melts at sea level. It will be noted 
that three special prescriptions are made, namely, melting point, pure 
ice, and sea level. Constancy or stability of the zero point is achieved 
by meeting these three prescriptions. 

The unit of temperature measurement, the degree, is also arbitrarily 
defined. In the centigrade scale, a second point on the scale was estab- 
lished as that temperature at which pure water boils at sea level. This 
temperature was given the value of 100. The degree was then defined as 
Y o0 of the distance between this point and zero temperature. 

Measurement of temperature, then, is obtained by starting the meas- 
uring process from a relative zero and by using a unit in a wholly differ- 
ent characteristic than temperature itself. 


UNITS OF MEASUREMENT IN PSYCHOLOGICAL ANALYSES 


Kinds of Units. Physical Units. Physical units are those that have been 
developed in connection with the description of physical phenomena. 
Many of these units are essential to the work of the psychologist, such 
for example as units of time, length, weight, voltage, force, etc. Because 
most behavior expressions have a durational characteristic, the unit of 
the time dimension particularly proves of inestimable value. 

Psychological Units. Psychological units consist of small amounts OF 
divisions of behavior or performance, and reflect either positive an 
effective or negative and ineffective response. The following are some 
examples: completing a trial in a maze, typing a page of text, failing to 
work a mathematical problem, constructing some object such as a picture 
puzzle, misspelling a word, failing to accomplish a given task in the 
allotted time, and the like. 

Psychological Units as a Small Amount of “The Thing Itself.” It is some 
times possible to utilize as the unit of measurement a small amount © 
the psychological characteristic being assessed. In measuring the ability 
to add, the unit used is a problem in addition, In measuring the ability 
to spell, the unit used is a word spelled. Here we are defining the abilities 
to add and to spell in terms of the characteristics elicited in the perform- 
ance of individuals when they are solving problems and spelling words. 

It may be argued that by the phrase “ability to add” we mean more 
than merely the characteristics manifest in working problems; that there 
is something over and above the performance that we wish to call “the 
ability.” This is more clearly seen in the measuring of the variable of in- 
telligence. In the measurement of intelligence, we use such items as 


— 
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analogies, arithmetic problems, identification of objects, detection of rela- 
tions, number completion, interpretation of proverbs, arithmetic reason- 
ing, and others. The argument is made that intelligence is reflected in 
every one of these types of items and therefore should not be identified 
with the psychological processes involved in the solution of any one of 
them. Obviously, the units differ from one type of item to another, so 
there is really no single unit of measurement. It cannot be construed that 
these units are small amounts of the ability called intelligence. When the 
characteristic is defined in terms of the objective manifestations of per- 
formance, however, it is possible to interpret the problems or exercises 
used as small amounts of the characteristic being measured and to use 
the problems or exercises as units of measurement. 

Units of a Functionally Related Variable. When no unit in the char- 
acteristic can be found, it is sometimes possible to devise a unit in an- 
other characteristic related to the characteristic that we wish to measure. 
A functional relationship is then the basis of measurement, and it is im- 
Portant that there be a known and close correspondence between the 
changes in magnitude of the original variable to be described and the 
changes in the values of the adopted variable through which measure- 
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level of proficiency. In none of these instances is the variable in which 
we are interested being measured directly. Rather, we measure changes 
in the variable in terms of differing amounts of another variable the 
values of which are positively related with those of the first variable. 

The Problem of Equality of Units. Physical Units. When we use physi- 
cal units for measuring psychological variables, we often escape the 
problem of inequality of units. Obviously, this is true when we measure 
physical characteristics of individuals such as height and weight. It may 
also be true when the characteristic being measured is psychological in 
nature. In the latter case, the ability is defined in terms of concepts for 
which we have objective counterparts. The measurement of reaction time 
will be used as an illustration. 

Reaction time is defined as an ability possessed by the individual. Its 
primary meaning is that of duration. This meaning is also the most im- 
portant meaning of the time dimension, and we have invented many 
measuring instruments for assessing differences in duration. In a situation 
in which the individual, through his performance, can objectively express 
his reaction-time ability, we can utilize one of the time instruments and 
accurately assess this durational meaning of his reaction time. For ex- 
ample, the braking-reaction time of an automobile driver is measured by 
the time required in removing the foot from the accelerator pedal and 
applying it to the brake pedal. 

Psychological Units. In the measurement of psychological variables, 
quantitative description at low levels of precision can be attained without 
equal units of measurement. Ordering by means of frequency and rank- 
ing can be achieved without equality of the units. If the ranks can be 
accepted as being approximately equivalent, more exact descriptions 
are made possible. 

When we use such units as a problem worked, a word spelled, an error 
committed, and the like, the tacit assumption is made that within a given 
instrument any unit is equal to every other unit. Obviously, this assump- 
tion is not true. For example, in arithmetic tests, differences between 
problems will be smaller when we use a large number of simple prob- 
lems than when we use a small number of complex problems. In other 
words, equality of units is approached as the problems are made simple 
in nature. This is one of the common procedures used in obtaining equal 
units. It is severely restricted in application, however, being usable with 
only very simple psychological processes, 

Another procedure suggested for achieving equality of units in psycho- 
logical tests is to determine empirically the difficulty of 
then use only those items that are known to be e 
for an arithmetic test, the items would be given to 
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viduals comparable to those for whom the test is being devised. The 
difficulty of each item would be determined in terms of the proportion 
answering it correctly. The test would then be constructed of items of 
approximately the same difficulty. 

The foregoing procedures for achieving equal units of measurement 
ant psychological variables, particularly 
yercome this deficiency, the psycholo- 
for achieving equal units. Brief con- 
one of these statistical procedures. A 
al aspects of the procedure 


are not applicable to many import 
to global or molar behaviors. To o 
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more complete discussion of the mathematic 


will be found in Chap. 10. 
The Standard Deviation as a Unit of Measurement. One of the recurrent 


problems in psychology is the measurement of differences in performance 
both within individuals and between individuals within a group. The 
standard deviation serves as an accurate index for measuring these differ- 
ences, and so it serves as a unit of measurement for evaluating perform- 
ance. The measurement of the performance of persons within a group 
will be used to illustrate the serviceability of this unit. 

When many persons are measured by means of some test or other in- 
strument, the performance scores vary in value, there being a striking 
um value to occur more frequently and scores 
ess frequently. A definite relation- 
of occurrence and the size of the 


tendency for scores of medi 
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ship is found between the frequency 
score, a mathematical function occurring that is similar to the one dia- 


grammed in Fig. 1 of Chap. 5. There is a mathematical formula available 
for expressing this relationship, and the standard deviation, which is 
part of this formula, is used to measure the amount of dispersion or spread 
occurring among the scores. It is possible to determine the distance that 


separates any score from any other score and to express this distance in 


terms of the number of standard-deviation units. It is also possible to 


express in standard-deviation units the distance separating any given, 
score from the mean of the scores. 
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in which it is used. We have already learned that one of the important 
characteristics of a unit of measurement is that it have a constant value. 
We have, then, in the standard-deviation unit a constant unit of meas- 
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ance only. Because of its almost universal applicability, the standard- 
deviation unit has proved the most useful unit of measurement yet de- 
vised for evaluating human behavior. 


ZERO POINTS IN PSYCHOLOGICAL MEASUREMENTS 


It will be remembered that accurate measurement is made possible 
when we have equal units of measurement and stable zero points. When 
we meet these two requirements we can order either individuals or 
groups on a continuum representing the characteristic being measured. We 
have the basis for making accurate comparisons among individuals or 
among groups. 


Psychology’s Need for Zero Points. In the s 
need stable relative zero points. Our primary concern is with comparing 
the performances of different individuals and groups, or with comparing 
two or more different performances of the same individual or group. 
In each of these tasks we are concerned with the quantity of a given per- 
formance in relation to the quantity of another performance. We are not 
concerned with the quantity of a given performance in relation to “no 
performance at all,” i.e., absolute zero, This is to say that, in the main, 
the needs of psychological measurements are satisfied when the two 
performances are comparable. 

The two conditions that must be realized to make two or more measure- 
ments comparable are those already mentioned, namely, equal units of 
measurement and common stable zero points. Given equal units, relative 
zeros—if common and stable—are then sufficient to accomplish the major- 


y of comparisons involved in the quantitative evaluation of human be- 
avior. 


The Need for Common Zero Points. By “the need for a common zero 
point? we mean that the point must have the same meaning for measure- 


being described. No prob- 
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If we gave the group a difficult physiology test and an easy psychology 
test, we would not be justified in assuming that the zero points on the 
two tests were comparable. 

In this problematic situation the psychologist has appealed to statisti- 
cal procedures for achieving a solution. A stable zero point in each sub- 
ject is statistically feasible. The mean score is most frequently utilized. 
We must be assured, however, that the mean scores are not subject to 
large sampling errors and that they are comparable in the sense of being 
at approximately identical points on the continua of the two abilities. 
This comparability is attained by randomly selecting the subjects from 
appropriately defined populations and by utilizing large numbers of sub- 
jects. In our example, if the group is representative of college sophomores 
in so far as their knowledge of physiology is concerned and if, similarly, 
they are representative of college sophomores in so far as their knowl- 
edge of psychology is concerned, then with a large number of cases enter- 
ing the computations we should obtain mean scores that are equivalent 


on the two continua. 
Characteristics of Relative Zero 
acteristic of stability mentioned 


s. A relative zero should have the char- 
in previous discussions. Furthermore, 


if comparisons of two or more variables are to be made, then the relative 
zeros in the several variables should have a common meaning pertinent 
to the purpose of measurement as described in the foregoing section. 
Two additional characteristics of a relative zero are important. It 
should be rigorously defined, and it should be relatively easy to 1 
mine. By being rigorously defined we mean that the zero point should be 
so described that different investigators are able independently to 
arrive at its value. This can be accomplished if the zero point is stated 
algebraically in terms of a mathematical formula. Ease of determination 
is recommended as a characteristic in order that the zero point will re- 


ceive wide use in scientific investigations. 
The mean score, when determined from a 
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conceptualize an absolute zero, as was previously pointed out in regard 
to the ability called intelligence. But for such variables no practical 
situations have been discovered by which such conceptualized zeros can 
be empirically demonstrated in performance situations. In the case of 
some psychological variables, absolute zeros cannot even be sensibly 
conceptualized. What would be the meaning of an absolute zero in per- 
sonality or of an absolute zero in social behavior? It is difficult to con- 
ceive of a person without personality or of one who does not manifest 
social behavior either adjustive or unadjustive in nature, 

In most measurement situations in psychology we can evolve meanings 
that are restricted to the objective performance that provides the meas- 
ures, or we can rationalize beyond these meanings to conceptualized 
characteristics that we assume underlie the performance. If we restrict 
ourselves to a consideration of performance per se, we can apply an 
absolute zero, defining it as no performance or zero performance on the 
particular measuring instrument being used. In this instance we can say 
that a performance of 40 is twice as great as a performance of 20. 

In many problems, the nature of the conceptualized ability is the goal 
that is sought. We then utilize the performance scores to represent the 
conceptualized ability considered basic to the performance. In this in- 
stance there is no absolute zero. If we wish to compare differences in 
absolute amounts of this ability by means of differences found in the 
empirical scores, we are not justified in making comparative statements 
that involve the number meanings of multiplication or division. We must 
conclude that in so far as human abilities are concerned we are not justi- 
fied in our comparisons of individuals or of groups to say that one score 
represents a given number of times the amount of ability represented 
by another score. 

Measurement of psychological variables is handic: 
ever, by the absence of absolute zero points. As no 
accurate measurement can be accomplished with 
Common stable zeros as points of reference are su 
precise quantitative comparisons. 


apped very little, how- 
ted in earlier sections, 
stable relative zeros. 
ficient for achieving 


THE QUANTIFICATION OF CONCEPTUALIZED ABILITIES 


Several times we have had occasion to refer 
ties underlying human behavior, Conceptu 
which we can explain the objective performance. When these conceptual 
abilities are supported by sufficient evidence we accept them as facts 
and proceed to work out their individual characteristics. 

The Basis for Postulating Conceptual Abilities, The logic of postulating 
abilities underlying behavior is not difficult to understand. We observe 


to the abilities or capaci- 
al abilities are postulated by 
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. quality and quantity of performance in different 
a ame or very similar sets of stimulating conditions. 
Inasmuch as the environmental stimuli eliciting the behavior are ap- 
VVV 

s in the psychological 
make-up of the individuals. 

Facts point to the existence of stable characteristics in the make-up 
of every individual. If we repeatedly observe the behavior of a given 
individual we discover that certain qualities and combinations of qualities 
are frequently manifest in his performance. We therefore postulate one 
or more abilities as being responsible for these stable characteristics. We 
then devise names by which we can distinguish these abilities. 

The Logic of Measuring Conceptualized Abilities. The logic of measur- 
abilities is straightforward and simple. The qualitative 
and quantitative characteristics of different individuals are stable and 
occur frequently in the form of overt response patterns. To describe these 
responses accurately, we devise stimulating situations that will consist- 
ently elicit these stable performance patterns from the individual at dif- 
ferent times. To introduce quantity into our descriptions, we select those 
aspects of the performances that are amenable to the assignation of num- 
ber meanings. If, when tried out empirically, such stimulating situations 
consistently elicit the same quantitative and qualitative characteristics in 
performance, we infer that we have measured that stable characteristic in 
the individual that we have called his ability. 

The Nature of the Relationship between Performance and Ability. 
The interpretation of the nature of the relationship between the per- 
formance and the postulated ability is one fraught with difficulty and 
What we know are the facts characterizing the objec- 
al. What we want to know is the nature 
performances. We must infer the 


ing conceptualized 


misunderstanding. 
tive performances of the individu 
of the abilities that lie behind the 
latter from the former. 


Various interpretations of the relationship between the performance 


and the ability are possible. We can assume a one-to-one relationship 
between the objective response and the postulated counterpart, regard- 
less of how we define it. In this case we can assign identically the same 
number meanings to the ability as we are able to justify for the perform- 


ance. Another interpretation is to apply the number meanings to the 
but assume that the quantitative relationship be- 


and the ability is less than perfect and that, 


therefore, the number meanings apply only in a rough way to the postu- 
lated ability. A third interpretation is to apply the quantitative meanings 
to the performance and make no attempt to apply any of them to the 


ability. 


objective expression, 
tween the performance 
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Which of these interpretations, or some other, will be championed in 
the quantitative description of a given ability will depend upon several 
factors. Important among these are the amount of evidence favoring the 
existence of the postulated ability, the precision with which the objective 
performance can be quantitatively described, and the extent of the 
empirical evidence justifying the assignation of number meanings to the 
ability. The scientific psychologist cannot afford to remain strictly on an 
operational level and define all factors and relationships in terms of only 
the situation in which they are found. With this approach, psychological 
theorizing is severely restricted, a situation that science can ill afford 
to tolerate. On the other hand, error can be committed in assigning a 
reality to the postulated ability before the facts justify it. In this instance, 
the scientist, in terms of certain performance measures, may interpret 
the ability as a fact when in reality it is not. He may then make predic- 
tions about the abilities of individuals that lead to errors of serious 
consequence. 
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PART TWO 


Steps of the Scientific Method 


The scientific method does not consist of a single set of steps followed 
in some invariable chronological order. To a large extent, the chronology 
of a scientific experiment is tied to the ups and downs of the scientists 
motivations, At some time in the early training of a student of science, 
however, there is need for a systematic treatment of those steps that are 
fundamental to any scientific attack upon a problem. In presenting such a 
treatment, no claim should be made that the steps are well delineated 
one from another or that the order of the steps is established regardless 
of the nature of the scientist’s progress. The chronology of the steps given 
herein is to be interpreted as a serviceable one and as a frequently used 
one, but not as an invariable one. 

The definition and delimitation of a scientific problem are discussed in 
Chap. 7. It is accepted as a fact that we can get a solution faster and the 
solution is likely to be a better one if we understand all of the elements 
and implications of the problem to be attacked. The time spent in the 
early phases of a project learning about the nature of the variables in- 
volved is amply repaid in later stages in the form of a more precise and 
dependable experimental design. 

A very frequently used procedure of the scientist is to set up his prob- 
lem in the framework of a hypothesis. The value of this approach is 


pointed out in Chap. 8. A systematic account is given of the steps used 
in deductively elaborating a hypothesis and in evolving theorems and 
testing situations for collecting facts about the hypothesis. In systematiz- 


ing these steps, fairly precise definitions are given to the concepts pre- 
sented, It is not intended that the treatment reflect the variety of interpre- 
tations given these concepts by other scientists. The object is to give the 


reader a clear exposition of the logical processes involved so he can see 

both the means and the ends of hypothesis formulation. Once the logic 

is understood, the reader should have no difficulty deciding upon the 

meanings he personally wishes to assign the concepts. 

When the problem has been formulated, the next step is to collect facts 
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that are pertinent to the hypothesis. Chapter 9 describes how the scientist 
must devise an empirical testing situation in which the variables he 
wishes to study will be allowed to operate, freed from the influence of 
other interfering variables. The problem of control is a central one, as is 
also the problem of measurement. These problems are difficult to solve, 
especially when global types of behavior are under study. It must be 
recognized, however, that it is only through using adequate control and 
measurement procedures that trustworthy evidence can be obtained. 

The full import of the facts collected on a problem cannot be obtained 
from a cursory examination of the data. Chapter 10 presents some of the 
common problems faced by the scientific psychologist in the organiza- 
tion, analysis, and interpretation of his facts. The facts must be ordered 
in a way that will give answers to the questions that initiated the study. 
Certain characteristics of human behavior, such as the mean and vari- 
ability of performance, should be quantitatively analyzed and evaluated. 
When several variables have operated in an experiment, the nature and 
amounts of the relationships existing among them should be stated 
quantitatively. 

The final step taken by the scientist is to formulate generalizations 
about the meanings that are revealed in the data. As described in Chap. 
11, this is a hazardous and difficult task. It is the step wherein the scientist 
points out the value that his results may have. He suggests how his find- 
ings contain the solutions to the specific problems he is studying. He also 
projects his findings into yet unchartered areas and suggests implications 
that they have for yet unsolved problems. The rational deductions he 
makes must be based on sound logical relationships between his own 
findings and the elements and characteristics of the situations to which 
he refers his generalizations. 
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CHAPTER 7 


The Definition and Delimitation 
of a Scientific Problem 


What starts the scientist on his quest? How do the problems arise which 
later he so intently studies? What does he do in order to discover the 
potential solutions that he undertakes to investigate? These are questions 
that long have troubled the scientifically untrained person and the scien- 
tifically immature individual who has yet to grapple at close hand with 


an original problem. 


THE CENTRAL POSITION OF THE PROBLEMATIC SITUATION 


Inquiry begins wh questioned, when a familiar 
solution fails, or when we do not understand some fact. The resulting 
uncertainty leads us to a consideration of the underlying factors and 
conditions and eventually to the recognition of a problem. Analysis of 
and possible solutions. 


m then leads us to trial hypotheses 
from a Problem. At the beginning, every scientific in- 
situation. The mere collecting 


en some past belief is 


the proble 

An Inquiry Starts 
vestigation arises from some problematic 
of facts, regardless of how precise the procedures and techniques that are 


utilized, does not constitute a scientific investigation. It is idle to collect 
facts unless they are referred to some problem or question. 

Sometimes a student of science becomes enamored of a method or tech- 
nique and then looks about for a problematic situation in which to use 
it. For the most part this approach is unproductive except as it serves to 
develop the students skill in the techniques employed. The primary con- 
cern is to define a problem, not to look for a method. So much is known 
about methodology that the solution of many new problems seldom 
demands a radical change in method. Even research findings of “world- 
shaking” proportions have arisen from only slight alterations in already 
well-established and familiar methods. 

One of our first tasks is to develop an accura 


description of our problem. We shall find this task 
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te and comprehensive 
much more difficult 
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than we at first suppose. It is not easy to put our finger on the particular 
crucial points that are giving rise to our doubt and uncertainty. In fact, 
it is a mark of scientific genius to discover immediately the key sources 
of a problem and to know at what points to apply scientific procedures 
to discover a satisfactory solution. 

The Problem Sets the Direction of the Study. The initiation and direc- 
tion of all phases of an investigation are influenced by the problem. As 
already mentioned, a scientific investigation is not aimless fact gathering. 
It is directed toward the attainment of specific aims, and the facts which 
are collected gain significance in direct proportion to their contribution 
in attaining these specific aims. To understand the problem means to 
learn the location of the key points of difficulty. Knowing these points 
enables us to direct our efforts insightfully. We avoid running after in- 
triguing but false notions. We are better able to relate the subproblems in 
the most meaningful relationships. We know which phases merit first con- 
sideration. If we know the nature of the problem we know what to look 
for, where to look for it, and when it can be expected to occur. 

The Problem Reveals Methods and Procedures. The nature of the 
problem directly conditions the methods. Frequently, the problem implies 
or designates the particular methods to be used. Different types of prob- 
lems raise different types of questions requiring different types of answers. 
These answers must be discovered through the use of different types of 
procedures. The kinds of answers needed, then, determine the approach 
we make to the problem. As soon as we have gained an adequate knowl- 
edge of the problem, we can expect to discover some kind of method 
for attacking it. 

Occasionally a scientist gets so wrapped up in a particular procedure 
that he uses it on nearly every type of problem. His conviction is so strong 
that he will not wait for the analysis of the problem to give him needed 
knowledge concerning the nature of the most appropriate procedures. In 
his cocksureness he modifies the problem to adjust it to the procedure, 
rather than modifying the procedure to adjust it to the problem. When 
he has completed his study he may, no doubt, have arrived at a solution, 
but certainly the solution will not be for the problem that originally stimu- 
lated his interests. Knowledge of the problem not only precedes the 
application of methods; it determines what methods are appropriate. 

Knowledge about the Problem Aids in the Control of Bias. A thorough 
knowledge of the problem lessens the opportunity for the introduction 
of the biases of the investigator. Every investigation will reflect the quali- 
fications and points of view of the investigator because he directly or in- 
directly enters into every phase of the study. There are, then, many 
opportunities for the entrance of bias. The initial phases of a study are 
particularly likely to reflect the peculiar notions of the investigator. Here 


EEE 
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he first encounters the unknowns with which he must deal, and he will 
employ every conceivable scientific “trick” at his command to understand 
arm is likely to be a little less critical at this stage, when the “going 

A thorough analysis of the problem will reveal the many ramifications 
that it has and will enable the scientist to track down its many implica- 
tions. Such an analysis will tend to point up neglected phases and to 
reveal any twists resulting from personal bias. There is no guarantee 
against the introduction of bias into a scientific investigation, but a com- 
prehensive knowledge of the problem will greatly reduce the likelihood 


of its occurrence. 


EVOLVING A CONCEPTUAL FRAMEWORK FOR THE PROBLEM 


lem can be roughly divided into what 


are called facts and what are called explanations. The former are em- 
pirically substantiated. The latter are the best guesses and reasons we 
devise to enlarge our understanding of the problem. It is quite impossible 
to separate the two in our thinking while we are attacking a given prob- 
lem, For purposes of discussion, however, we shall want to treat them 
as if they were so separated. 
The steps involved in framing 
will be more easily understood i 


Our knowledge about any prob. 


a problem for scientific investigation 
f we parallel our discussion with the 


description of an actual problem. Suppose we select a problem in high- 
way-accident prevention, an area in which most everyone will have some 
background knowledge. To begin with, we should start with a very vague 
and general notion of the problem, that is, we should make believe that 
we are as naive as we would be with a problem about which we knew 
very little. As each step is presented we can then develop the problem 
little by little toward a more specific and workable problem for investiga- 
tion. Let us start with the general question: How can automobile acci- 


dents be reduced? 
The Meaning of a Conceptual Framework. The evolution of a con- 
ork for a problem means organizing our knowledge into 
s which will enable us to get a clear perspec- 
and which will reveal modes of attack for 
derstanding of the 


In gaining an un 
from every possible angle. We collect 
m together into some kind 


f the problem. We seek 
he explanations, and 
his way we extract 


ceptual framew 
a meaningful set of relation: 
tive of the variables at work 
collecting additional information. 
nature of a problem we study it 
both facts and explanations and try to piece the 
of unitary picture that will encompass all phases o 
out relations among the facts, relations among t 


relations between the facts and the explanations. Int c 
as much meaning as we can from our present knowledge, and using this 
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as a base we then evolve possible solutions to the problem which can be 
tested under empirical conditions. 

The Listing of all Possible Constituents. The Problem. Our first step 
is to list all possible constituents of the problem. A constituent is any 
item of information, either factual or theoretical, that we consider relevant 
to the problem. For most problems with which we would be concerned 
we would have some items of information. We would know about some of 
the factors, elements, aspects, characteristics, conditions, ete., that con- 
tribute to it. At this point we should not be too ready to discard items 
of information that seem to have little meaning for our problem, for we 
are not prepared to make sound judgments about the pertinence of the 
relationships involved. We should include the questionable items and 
make the listing of constituents as complete as possible. 

Our list should include all explanatory postulates that in any way 
bear on the problem. The phrase “explanatory postulates” sounds some- 
what formidable, and may cause us to wonder why at this early stage—a 
stage bordering on complete ignorance—we should attempt to deal with 
elusive conceptions instead of concrete down-to-earth facts. The logic 
of justifying this course of action is simple. What is needed for solving 
any problem is a solution. At the beginning, a solution consists primarily 
of an explanation. If we discover the explanation of a problem, we usually 
solve the problem. The elements or relationships that we conceive as a 
solution of the problem are then to be listed as constituents. 

Even though the task is a difficult one, it cannot be overemphasized 
that we must thoroughly examine the theoretical constituents of the 
problem. As we learned earlier, explanation projects beyond the known 
into the unknown. Explanatory postulates serve as bridges by which we 
cross from the known constituents to the unknown solutions. Through 
them we are able to set up possible solutions that can be tested em- 
pirically. 

An Illustration. Let us list some of the constituents of the problem of 
how to prevent automobile accidents. Remember these are ideas that 
come to mind as we think about the problem, ideas that we think some- 
how belong to the problem. Let us phrase them just as they come to mind. 


The degree of traffic congestion. 

The time of day of accidents. 

Glare blindness. 

The condition of the roadway. 

People who drink should have their driver's license permane 
Divided highways. 

Make of car. 

Poor eyesight causes most automobile accidents. 


ntly revoked. 
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Most drivers are accident prone. 
Violating the right of way. 

Reckless drivers cannot be improved. 
Good roads will eliminate accidents. 
Sex and age of the driver. 

Type of accident. 

Highway markers. 


Some of these constituents concern situational factors, others concern 
personal or driver factors. Some are specific in nature, others are rather 
general. Some are factual and concrete, others are explanatory and ab- 
stract. The interrelations among some are close, among others remote. 
Obviously, with further thought many more constituents could be listed. 


We should note that some of the explanatory postulations suggest deter- 


miners to be investigated. For example, visual acuity as a determiner of 


automobile accidents can be developed readily from the rather extreme 
statement that poor eyesight causes most accidents. Here, then, is a 
bridge from a known theoretical constituent on the one hand to an un- 
known but possible solution on the other. Furthermore, this suggested 
functional relationship is one that can be studied empirically. 

In connection with this same suggestion. the reader is invited to demon- 
strate for himself how the nature of the problem contributes to the de- 
termination of the method. No mention will be made here of a possible 
method to study visual acuity in relation to accidents in order that the 
reader can devise one by himself and in so doing become aware of how 
the understanding of a problem readily leads to a possible method for 


its investigation. 
Determining the 

lem. Having listed as many consti 

examine the theoretical basis for e 


Theoretical Security of the Constituents. The Prob- 
tuents as we can, the next step is to 
ach constituent. The expression theo- 
retically secured means that there is an acceptable explanation indicating 
that the constituent bears a relation to the problem. We assume that there 
is a theoretical structure (explanation) for each constituent. We must 
discover it, trace it, and describe it. We do this by determining the nature 
of the relationship existing between the constituent and all other constit- 
uents, both factual and explanatory. We should not be concerned at this 
point if we find no obvious relation between a given known and theo- 
retically secured constituent and a constituent that is only guessed and is 
theoretically unsecured. Although we should be on the lookout for such 
relationships, we should not despair if they are not observable at this 
early stage of our analysis. 


Many of the constituents can be plac 0 
1) constituents that are determinate and theoretically 


ed in one of three general classes: 
secured, (2) con- 


138 Steps of the Scientific Method 


stituents that are indeterminate but theoretically secured, and (3) con- 
situents that are indeterminate and not theoretically secured. By determi- 
nate is meant that facts are available to support the contention that the 
constituent can be examined empirically. In our illustration, the make 
of the car, the time of day, and the condition of the roadway are examples 
of determinate constituents. Indeterminate refers to the lack of evidence 
to justify at the time the belief that the constituent can be examined em- 
pirically. An example is the constituent of accident proneness. This con- 
stituent has yet to be sufficiently clearly defined to be subjected to em- 
pirical testing. 

Some of the constituents will be difficult to classify and evaluate at 
the beginning of our analysis. Their theoretical significance may be un- 
known and revealed only after much further study. Also, with additional 
study we may have to revise some of our evaluations. A constituent might 
be judged to be secured theoretically, but after further empirical evidence 
is obtained it might be demonstrated that the factor has little significance 
for our problem. 

An Illustration. Let us consider the theoretical security of two of the 
accident constituents. First let us examine the factor of time of day. 
What we want to do is to show logically that time of day might have some 
pertinent relation to automobile accidents. There is considerable evidence 
that the effectiveness of man’s responses varies with the hour of the day. 
Studies made with factory workers doing various psychomotor tasks 
amply justify this conclusion. It would be expected that the psychomotor 
task of driving an automobile would be subject to similar diurnal fluctua- 
tions. If this is so, then the driver's effectiveness would vary and he would 
be more likely to have accidents at some hours than at others. We can 
engage in a little naturalistic observation here and suggest that the 
driver is not wide-awake early in the morning and is tired at the end of 
the working day, so his effectiveness at the wheel might be less at these 
hours. We can, then, probably safely assume that the time of day is re- 
lated with accidents and deserves investigation. 

A little more thought might lead us to think of density of traffic as 
also varying at different hours of the day. There is the morning rush to 
work and the evening rush to return home. We now have the time-of-day 
constituent coming into relation with the constituent of degree of traffic 
congestion. Further consideration might lead us to suspect that time of 
day may not be as important as we at first thought, that what we are 
calling time of day is not an accident determiner at all but merely an 
objective sign that helps us to discover and localize factors about the 
driver (sleepiness and tiredness) and factors about the roadway (con- 
gestion) that merit further study. 


Accident proneness can be suggested as another concept frequently 
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used in explaining accidents. It should be said, however, that the concept 
has been applied uncritically as a determiner of accidents. Presumably, 
according to this concept, if we could discover the accident-prone drivers 
and rule them off the highways, we would greatly reduce the number of 
accidents. It might come as a surprise to the reader to learn that accident 
proneness has not yet been shown empirically to be a useful concept in 
the prevention of automobile accidents. The concept is not as well se- 
cured theoretically as its proponents lead us to believe. 

Forming a Theoretical Framework for the Problem. The Functions 
Served by a Theoretical Framework. At this point in our study we are in 
need of discovering some unifying principles that will bring all the 
factors into an integrated whole. We have presumably completed a list- 
ing of all of the factual and explanatory constituents we can discover. 
The picture is somewhat chaotic because there are so many individual 
and apparently unrelated factors. We need to formulate a theoretical 


framework. 

The task is to go 
explanation of all constituents, and partic 
secured constituents and to discover explanations for the unsecured ones. 


It should be obvious that there will be differences in the degree of effec- 
tiveness of the explanations of the various constituents. Not all theory is 
equally sound. We find ourselves confronted with theoretically secured 
constituents, weakly secured constituents, and theoretically unsecured 
constituents, These latter may turn out to hold key positions in the 
eventual solution of our problem. Regardless of the theoretical security 
of the individual constituents, we must tie all constituents together into 
a theoretical system of interrelated structures, paying particular attention 
to the theoretical trends that extend from the center of the determinate 
and theoretically secured elements to the periphery of the indeterminate 


and unsecured elements. 

The formation of an over- 
essential theoretical perspective, 
aspects. No longer will the constit 


as far as we can to round out and strengthen the 
ularly to strengthen the weakly 


all theoretical framework gives us a very 
a perspective of the problem in its larger 
uents appear as unrelated elements. 
We shall see that the problem has ramifications in a variety of directions 
and thus can be studied from a variety of angles. The weakly secured 
stituents are accentuated and brought into relief by 
h the theoretically secured constit- 
align, shift, reinterpret, and 
manipulate in many Wa mphases by which tentative 
solutions can be evolved. It should be remembered that a theory must 
be discovered that will encompass the unexplained constituents, and in 
all likelihood this will be accomplished by extending to them the expla- 


nations of the determinate elements. 


and unsecured con 3 
their relations or lack of relations wit 


uents. Within our framework we can re 
ays the meanings and e 


140 Steps of the Scientific Method 


With this theoretical framework we can better determine the main di- 
mensions of our problem. Up until now we have not been able to dis- 
tinguish the heart of our problem from any other part of its anatomy. 
With a theoretical framework we can detect the more important determi- 
nate variables. We shall then be less likely to overlook the factors that 
hold the key to the correct solution. We can better detect false, though 
intriguing, leads and shall be less likely to proceed in directions tangential 
to the main issues of the problem. 

Another contribution of our framework of theory is that it furnishes 
a solid base of operations for attack upon the unknown factors. It 
furnishes bridges of theory by which we can go from the determinate 
elements to the indeterminate elements—from the known to the unknown. 
This is the crux of our task. When knowledge is lacking, explanation 
through theory must fill the gap. For our problem, which at this point is 
largely unknown, explanation through theory is not only necessary, it is the 
only way open to us. In tracing the nexus of relations existing between 
the constituents of our problem and in attempting to explain the unse- 
cured constituents, we have an opportunity of bridging the gap between 
the known and the unknown. In the theoretical perspective of the totality 
of constituents the position occupied by the theoretically unsecured con- 
stituents offers clues by which potential explanation of these constituents 
can be evolved. Tracing out the theoretical relationships among the con- 
stituents is, then, a powerful lever for breaking loose “chunks of the 
unknown.” i 

An Illustration. Let us make a beginning toward formulating a theoreti- 
cal framework of our accident-prevention problem. In our brief examina- 
tion of the constituents we saw that some were more closely related to 
our problem than were others. We also saw wide variation in the inter- 
relations among the constituents. Even with such a brief examination 
there began to emerge some broad characteristics that might lead us to 
an integrated picture of the constituents. We observed that some of the 
constituents were to be found in the driver, e.g., visual acuity, sleepiness, 
accident proneness. Other constituents were characteristic of the driving 
situation, e.g., condition of the road, divided highways, highway mark- 
ers. Probably most accident constituents could be placed in these two 
categories—the personal and the situational. 

Let us further examine the constituents for other general character- 
istics. Condition of the highway breaks down into many more specific 
factors like slipperiness due to snow and rain, rock slides, hairpin turns, 
blind intersections, narrow and soft shoulders, etc. If we examine these 
factors more closely, we find that they vary in their permanence. Rock 
slides are usually temporary or transitory hazards. Rain and snow are 
variable in their duration, depending upon geographic location and sea- 
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son of the year. Hairpin turns and blind intersections are likely to be 
more permanent hazards. Situational factors can then be described in 
terms of their degree of permanence. 

Can we now find this character of degree of permanence in the personal 
constituents? Fatigue and tiredness are probably to be classed as transi- 
tory. Depending upon the nature of the disorder, eye defects vary from 
very transitory, such as a little dust in the eyes, to permanent defects, 
such as a cataract. Some motor defects of the driver are transitory, as a 
sprained ankle; others are permanent, as loss of an arm. 
beginning of a theoretical framework. Constituents 


either personal or situational, and under each cate- 
semipermanent, or 


We now have the 


can be classified as 
gory they can be further classified as transitory, 


permanent. Only by considerably more analysis can we learn whether 


or not this particular framework will be a productive one. 

The task of formulating a theoretical framework for the accident- 
em should be interpreted as a continuous one. Further 
dimensions along which meaningful relations 
developed. Theory will expand and become 
continue to study the problem. From 
k definitive explanatory postulates 
analysis and eventual empirical 


prevention probl 
study will reveal other 
between constituents can be 
more and more intricate as we 
time to time within the total framewor 
will crystallize. These will merit further 
testing. 

Procedures to Use. In tracing the 


Discovering New Constituents. 
theoretical structure of a constituent, we shall eventually find ourselves 
ed with new constituents and their theoretical security. 
The discovery of new constituents necessarily follows from the essential 
3 s we found in earlier chapters. natural events, as 
acteristics of orderliness and inter- 
‘relatedness among the 


being concern 


nature of phenomena. 
them, have the char 


relatedness. We shall find this orderliness and inter 
constituents as we continue our search. 

At this stage the picture is not comp 
are only beginning to create a meani 
known to us. In continuing we must loc 


the picture as best we can. We must try 
the voids and then develop satisfactory explanations for them. The 


framework affords us a means of moving in theory towards these voids, 
extending and elaborating known theory toward unknown constituents. 
Here we are treading the narrow line between the known and the un- 
known. With the filling of each void we achieve a step toward the solu- 


tion of our problem. 

There are several little tric 
ents. One procedure is to wor 
explanation seems obvious and complete of 


we experience 


lete; it contains some voids. We 
ngful picture of all constituents 
ate the indeterminate factors in 
to discover new constituents in 


we can use in hunting for new constitu- 
k on the assumption that points whose 
ten need further study. An- 
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other idea to explore is that commonplace elements, which by snap 
judgment are considered irrelevant, may contain some significant mean- 
ings. Ideas that we may have tried and discarded in times past may 
deserve to be “resurrected” and given another examination. Much too 
frequently, as beginning scientists, we overlook the obvious elements 
because we have the notion that complex problems must have compli- 
cated solutions. Sometimes we can expect to find our solution in the 
simple relationships to be found among factors that are already familiar 
to us. 

An Illustration. Referring to the problem of accident prevention let us 
note how new constituents can be developed. Although the ideas to be 
presented are not now being developed for the first time, some process 
akin to the associations to be mentioned occurs when we begin to trace 
out factors in the development of a new problem. Let us state the over- 
all theory we developed: Accidents are caused by personal and situational 
factors, some of which are transitory while others are semipermanent or 
permanent in nature. 

When we considered the factor of time of day in our analysis, we were 
led to think about two other factors: sleepiness and fatigue. We also 
mentioned the characteristic of poor eyesight, which led to the constitu- 
ent of visual acuity. We could have raised the further question of whether 
other eye deficiencies might condition accidents, for example, color blind- 
ness, side vision, or vertical and horizontal imbalance. 

Inasmuch as one of the general categories is that of personal factors, 
we could comprehensively examine all manner of human responses that 
are related to driving behavior. As a result we might think of diseases, 
such as epilepsy; of habits, such as starting late for appointments; of atti- 
tudes, such as the obsession of having to pass all other cars; of the prac- 
tice of driving after drinking alcoholic beverages, etc. 

One of the constituents we listed was the type of the accident. This 
would be a fruitful lead to other factors. Study of head-on collisions 
might lead to the factor of three-lane highways and to the need for laws 
to regulate improper passing. Study of intersectional accidents might lead 
to an examination of street lights and signs, of one-way streets, of laws 
governing parking along main arteries during peak periods. 

It is apparent that we should take advantage of any association be- 
tween constituents that will lead us to new ideas. At this stage we should 
not be overly critical of the relevance of these ideas but rather adopt the 
philosophy that the more constituents we can discover, the greater the 


chances will be that we shall hit upon constituents that will lead us to a 
solution of the problem. 
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TESTING THE THEORETICAL FRAMEWORK AGAINST FACTS 


In evolving a theoretical framework for our problem, we made use of 
any facts that came to mind. We did not systematically review the 
known facts in support of our constituents. Now, with the conceptual 


meanings of our theoretical framework before us, we can collect our 


facts more intelligently because our theories will suggest the directions 


we should take in our excursions into facts. 

The General Nature of the Factual Analysis. Current facts are to be 
brought to bear on the entire theoretical system. Seldom will adequate 
explanations be derived from a study restricted to the theoretical phases 
of a problem. The exposure of known theory to factual tests usually 
results in the improvement and extension of the theory. The factual study 
now being suggested revolves around the theoretical framework with 
the intention of correcting its deficiencies and extending its coverage. | 

This factual study is not intended to be an intensive empirical testing 
of any specific postulate. Actually we have not yet evolved a particular 
problem. We are still working with the over-all picture, which includes 
all of the constituents. We are not ready to delimit some specific explana- 
tory postulate for intensive study. Inasmuch as our main objective is to 
understand the over-all problem, the factual study is oriented toward 
this objective, which means, then, that theory and explanation are central 


to the factual analysis. 
In our accident-prevention pro 


begun. It lacks coverage and detail. We i 
any experimental investigation. There is a world of facts available that 


we must incorporate in our system. These facts will help strengthen the 
weak spots and will furnish support for many of the interrelations that 
we perceive developing in our framework of theory. o 
Starting Factual Analyses with Simple Situations. In beginning our 
factual study, we should start with simple manifestations of our constitu- 
ents. In our state of relative ignorance we would do well to begin our 
analysis with simple constituents and leave tia e — = 
volved ones for later examination. The implications of the simpler * 
ments will be easier to trace than the implications of the more comp = 
ones. This does not mean that we should neglect the oompie, es pe. 
postpone their consideration in the hope 0 eens hae 7 0 ; : 
simpler elements will give us à petter preparation for tackling the mor 
a ee ee problem, the constituents of “reckless yt i a 
be changed,” “people who drink should have their driver's — 
manently revoked,” “most drivers are accident prone, are consti 


blem the theoretical framework is barely 
are certainly not ready to plan 
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that should prove very difficult to analyze. “Type of accident,” “time of 
day of accident,” “sex and age of driver,” are constituents that should 
prove less difficult of analysis. 

Increasing the Theoretical Security of Constituents. The Problem. One 
step to keep continually in mind during the factual analysis is to relate 
the facts to the explanations of our theoretical framework. In reference 
to theory, facts can be utilized in two ways; namely, facts can be used 
in the discovery of theory, or facts can be used in the testing of theory 
already discovered. We are suggesting the use of facts in both ways, but 
we are primarily concerned with the discovery of theory. The theoretical 
framework that we devise for any problem must be filled in, its weak 
points strengthened, its boundaries enlarged. When we have completed 
this task, we are then on the way to evolve a specific problem. The 
theory underlying the specific problem will then need to be subjected to 
an empirical test. 

By exposing the theoretically secured constituents to factual study 
we not only obtain further evidence of their validity but we widen our 
knowledge of the theoretical relations that these constituents have to 
others. As we noted earlier, in a theoretical analysis of a problem con- 
stituents will vary in their theoretical security, some being adequately 
explained, others having no satisfactory explanation. We will gain greater 
precision in our explanation of weakly secured constituents if they are 
brought into theoretical proximity with those that are adequately secured. 
We may be fortunate in finding factual situations in which secured and 
unsecured constituents are present and interacting. Such situations should 
be highly prized and they should be carefully examined for all charac- 
teristics and interrelationships. If there is linkage between the theoreti- 
cally secured and unsecured elements, we may immediately gain an im- 
portant leverage over the explanation of the unsecured constituents. 

An Illustration. In our accident problem, we suggested poor visual 
acuity as a probable determiner of accidents. We should seek facts on 
this factor. We are certain that blind people should not drive—although 
occasionally one is found driving, depending upon another person to 
direct his driving activities. We ought to find out what level of acuity is 
necessary for driving. Is it necessary to have the same keenness of vision 
for driving a car as is required for reading a book? Is driving a problem 
of near vision or of far vision? Can a person have good near vision and 
poor far vision? Is the opposite of this possible? If a driver cannot sce 
very effectively without glasses, does the wearing of glasses, which gives 
him normal vision, satisfy the visual-acuity demands of safe driving? 
These are questions that point up some of the aspects of visual acuity for 
driving. They are questions about which facts can be collected. There 
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may be empirical studies available that will enable us to evaluate better 
the place of this constituent in our theoretical system. 

Analyzing Indeterminate Constituents. The Problem. Factors may be 
classified as indeterminate constituents because we have insufficient 
knowledge about empirical situations in which they might be manifest. 
An examination of the available facts may reveal ways by which a for- 
merly indeterminate constituent is seen as determinate. Factual studies 
should increase our knowledge of situations where an indeterminate 
constituent might be found or of situations where other constituents pos- 
nifest. As stated before, it is through investigat- 
a constituent with other constituents that 
revealed. 
ess is one of the least 
rminers of accidents. 


sibly related to it are ma 
ing the possible relationships of 
its determinateness and theoretical security are 

An Illustration. The concept of accident pronen 
riables suggested as dete 
ther or not it is really a determinate con- 
meaning of accident proneness, the 


understood of any of the va 
as to whe 


The question arise 
stituent. When we inquire into the 
most frequent answer we receive is that it refers to proneness to have 
accidents. This is a very unsatisfactory definition; it is defining a concept 
by merely repeating the term itself. Of course, statistics are presented 
to show that a large percentage of accidents can be attributed to a small 
percentage of drivers. But when such questions are asked as: What is it 
that makes the driver accident prone? Is it his hearing? or his vision? or 
his motor coordination? there are no replies. 

What is needed is a search for facts about accident proneness, how- 
ever it is defined. Before we can attribute accidents to some enduring 
process or deficiency of an individual, we must show that the so-called 
accident-prone drivers continue to have high accident records for a con- 
siderable length of time. One factual source of information might be the 
records of the frequency of accidents of a group of individuals which 
extend over a fairly long period of time. We could determine if the 
accident rate remains approximately constant for a certain proportion of 
the drivers. It would be interesting to know if those drivers who are 
known at a given time to be responsible for a large proportion of acci- 
dents continue month after month and year after year to remain in the 
high accident group. If they do, this fact would point to the possibility 


of some enduring process or deficiency of the driver. It would not tell 
us what the condition of the driver was. and we would still need to seek 
additional facts to learn the exact nature of this enduring condition 

which masquerades under the term accident proneness. 
Discovering Unknown Constituents. The Problem. In the theoretical 
discovering additional 


framework of all problems there is the need of 
constituents. There are unknowns to explore. We are not always certain 
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of where the gaps in our knowledge are, and do not always understand 
the nature of these gaps. The more we can use the theoretical framework 
to estimate what the probable qualities and characteristics of the un- 
known constituents are, the sooner we shall be able to find factual situa- 
tions in which we might discover them. The interrelations worked out 
in our theoretical system provide much information about unknown 
constituents, but this information needs to be checked against fact. 
Exposing our ideas about the projected unknown constituents in factual 
situations where determinate constituents are operating often results in 
improving our understanding of the unknowns. It provides new implica- 
tions for theory that apply to the unknown constituents. At all times in 
our analysis, this building of theoretical bridges to the unknown con- 
stituents must be kept constantly in our thinking. 

An Illustration. We have listed glare blindness and traffic congestion 
as two constituents in the accident picture. Statistical studies indicate 
that accidents occur more frequently at night than is warranted by the 
amount of night driving. Facts also indicate that although the prepon- 
derance of traffic is in the urban areas, about half the accidents occur 
on the open highways. We do not have an adequate theoretical frame- 
work to understand fully these two factual results, There are probably 
several unknown constituents in each situation, and these should be 
objects of discovery. Glare blindness provides us a start on night acci- 
dents, but it is wholly inadequate to explain the high frequency of these 
accidents. Traffic congestion is readily accepted as a very important 
source of accidents, the thought being that high accident rates are to be 
found where congested traffic conditions are prevalent. This determiner 
must be reexamined in the light of the fact that about half the accidents 
occur in noncongested areas. The study of night accidents and of open- 
road accidents should be further pursued in order to discover any addi- 
tional constituents that may underlie the high proportion of night acci- 
dents on the open highway. 

Three Standard-type Situations to Look for in the Factual Analysis. 
The scientist has developed several ways of organizing the variables of 
his problem in order to get the best revelation of the relationships he 
wishes to investigate. Although these organizations characterize the ex- 
perimental designs of laboratory research, arrangements somewhat simi- 


lar to them can be found in nonlaboratory situations. Three types of 
arrangements are presented in the following discussions. 


Factual Situations with and without a Given Constituent. A standard 
experimental situation often used by the psychologist involves two groups 
of subjects. One group is exposed to the experimental variable, the other 
is not. In all respects but this one the two groups are supposed to be 
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equivalent. If positive results are obtained from the experiment, they are 
then attributed to the factor in which the two groups differed. 

In searching for facts about our constituents we should be alerted to 
those accident situations that are comparable in all respects related to 
accidents except in reference to the presence of the constituent that is 
to be examined. There should be situations in which the constituent is 
operative and others in which it is not found. Such comparable situations, 
although not easy to find, offer a tremendous leverage for uncovering 
knowledge about a constituent because of the sharpness with which the 
constituent is brought into focus. 

A case in point in our accident problem can be suggested. Suppose 
that we are in search of information about the constituent of restricted 
side vision. The theory is that people whose vision is peripherally re- 
stricted will have more accidents than those without restriction, because 
the restriction will prevent them from always seeing cars coming in from 
the side. We can ask: In what practical situations would the handicap 
of restricted side vision be likely to interfere with safe driving? Cer- 
tainly cutting in on the highway would be one. Driving behavior at 
intersections would probably be an even better situation to study because 


of the greater frequency with which safety problems arise in this situa- 
dent statistics for two 


tion. The procedure would be to examine the acci 
vision and the other 


groups of drivers, one group with restricted side 
with no restriction. Accidents at intersections would be selected in which 


factors other than side vision restriction could be ruled out. Any differ- 
accidents between the two groups would then 


be largely attributable to the factor of restricted side vision. 
Factual Situations with a Constituent Varying in Amount. Another 


standard experimental procedure in psychology requires that several 
ariable be created that are equiva- 


situations containing the experimental vi : 
lent in all relevant respects except in the amount of the expression of 
d that if the experimental variable 


the experimental variable. It is expected th e i 
is related to some end effect, this effect will vary in some consistent way 


with the differences in amount of the experimental variable. Suppose we 
vay the intensity level of hearing of a group 


wish to measure in a crude w $ 
of individuals. We use electrical devices that will enable us to keep the 
pitch of the tone constant while varying the intensity through measurable 
levels. We ask the individuals in the group to raise their hands when 
they hear the sound. A few hands will be raised at the low intensities, 
and as the intensity is increased the number of hands that are raised will 
increase, A functional relationship can be found between the intensity 
of the sound and the number of individuals who are able to hear it. ; 
Certainly the accident constituent of speed of driving is associate 


ences in the frequency of 
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with frequency of accidents. We should be able to find accident situa- 
tions in which the rate of speed varied. Our principal problem in such a 
factual study would be to achieve equivalence between the situations in 
all relevant factors other than that of speed of driving. In situations in 
which equivalence was achieved we could make an accurate analysis 
of the relationship between speed of driving and frequency of accidents. 

Factual Situations Involving Interaction. Interactions might be found 
under a variety of conditions. Interactions among different theoretically 
secured constituents might be found. Such situations would supply us 
with information concerning the relative importance of the several con- 
stituents. Constituents that are theoretically secured might be found 
interacting with weakly secured or unsecured constituents. Here we 
would find a means for extending our theories toward those constituents 
needing further explanation. Occasionally we might find highly complex 
forms of interaction among the constituents. In one situation an unsecured 
constituent might take on a very significant meaning on one oc asion 
because of its apparent relationship with a theoretically secured con- 
stituent, but on another and similar occasion this meaning might not 
apply. A search usually would reveal that the presence or absence of the 
particular meaning is a function of the nature of the interaction between 
the constituents. Such complex types of interrelations are not easily 
analyzed, but when found and understood they usually contribute a tre- 
mendous amount of information about the problem under study. 

In accident situations we encounter enormous complexity, interaction 
being the rule rather than the exception. For example, interaction occurs 
between the constituent of speed of driving and such other constituents as 
driver-reaction time, traffic congestion, ability to recover from glare 
stimulation, mechanical condition of the brakes, divided highways, slides 
on the highway, and many others. Obviously, in many accident situations 
several of these variables are interacting with one another. 

The problem of determining the relative importance of interacting 
accident variables is one of the most difficult tasks confronting us. One 
generalization can be expressed now. The relative importance for acci- 
dents of any given constituent is likely to be a function of the specific 
characteristics of the accident situation. Because of our limited knowl- 
edge we are not justified in indiscriminately applying a generalization 
about the importance of a certain constituent to situations in which 


many other constituents are involved until we understand the effects of 
their interactions. 


The Definition and Delimitation of a Scientific Problem 149 


THE DEVELOPMENT OF A SPECIFIC PROBLEM 


The General Evolution of a Problem. The evolution of a specific prob- 
ague notions and hunches about a sub- 


ject to a word picture of a particular problem which has precise theoreti- 
cal boundaries and is amenable to empirical study. The translation— 
really transformation—from vagueness and abstractness to preciseness and 
concreteness frequently runs a protracted and uncertain course. One of 
the “facts of scientific life” that we should Jearn early is that solutions to 
significant problems do not come easily. 

The nature of the specific problem will be a function of our interests, 
ability, and background. Certain phases of the theoretical framework 
will have more appeal than others because we understand them better, 
because they hold out more promise for immediate testing, because they 
are more closely related to some earlier studies, or for any one of many 
other reasons. From such factors our thinking takes shape and is chan- 
nelized in reference to some particular phase. Gradually there emerges a 
framework for a specific study. 

The Value of Asking Specific Qu 
tions serves the very useful purpose o 
tailed meanings from which a particular problem can be evolved. The 
general theoretical framework that we have formulated serves as a back- 
drop against which many individual issues and hypotheses can be devel- 
oped. Eventually we want to develop a particular problem for intensive 
study, We can ask ourselves many specific questions which concern 


special and detailed aspects of our original problem and which therefore 
will aid in accentuating t es out of which a special prob- 


lem can emerge. , l 
Many questions will arise during t 
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we must lose sight of the “forest” because dur attention is focused on the 
“trees.” Nothing prevents us from organizing a topic which will contain 
many individual problems and which will therefore be realized piece- 
meal by many separate inquiries. 

The possibility may arise of restricting the scope of our problem to 
such narrow limits that we do not get satisfactory answers to the ques- 
tions that we have evolved. Here the trees are obstructing part of the 
forest that is essential to our purposes. We should be particularly con- 
cerned with any restriction that so severely limits the theoretical develop- 
ment of the special problem that it is difficult for us to move out in 
theory toward other problems in the larger theoretical framework. 

Describing the Specific Problem. The description of any problem re- 
quires that the relationships among the concepts and facts be repre- 
sented in understandable symbols that carry meanings that are unam- 
biguous both to the scientist himself and to others who are capable of 
learning about them. This step of framing the problem in verbal terms 
deserves careful attention. It is a most difficult task. 

The Contents of the Description. In the process of tracing the theoreti- 
cal structure of the constituents of a special problem, many meanings are 
evolved, These must be incorporated in the description of the problem. 
There should be a description of the determinate, indeterminate, and 
unknown elements. There should be a description of the theoretical 
security of all of the relevant constituents, whether determinate or in- 
determinate. There should be an accurate tracing of all lines of relation 
as they extend between secured and unsecured elements, and between 
determinate, indeterminate, and unknown constituents. When possible, 
all constituents should be described within one framework of theory. 

We should describe all explanatory concepts that bear on the problem. 
Rather exact explanations of some of the constituents should be possible. 
The unknowns can be only alluded to. By tracing the theoretical lines 
from a known to an unknown, however, it may be possible to describe 
the expected characteristics and relationships of unsecured and unknown 
constituents. 

In our description we should include all factual information. We 
should support our lines of theoretical relation with whatever evidence 
is available. Our predictions of the characteristics of the unknowns must 
be related to whatever facts are known. 

The Accuracy of the Description. Our statements should be as accu- 
rate as we can make them. Word symbols are not single-meaning sym- 
bols. We are confronted with the task of forming patterns of words for 
the purpose of transmitting certain precise meanings to individuals who 
do not have the relevant background and experience that we presumably 
possess. Singleness of meaning is a primary objective, 
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No vaguely described inquiry is possible of solution. If we do not 
describe the problem accurately, we cannot expect to discover an ade- 
quate solution. When the problem is stated precisely and accurately, 
the nature of the observations required for investigating it are implied 
in the description. If we know the nature of the observations demanded 
by our problem, it is then but a short step—albeit sometimes a difficult 
one—to devise the procedures through which these observations can be 
made. 

The Comprchensiveness of the Description. The theoretical and factual 
analyses are conducted to detect and describe all of the angles, phases, 
aspects, conditions, factors, variables, and the like in any way thought 
to be associated with our problem. Somehow we must get all of these in- 
corporated into the description of the problem. Obviously the procedure 
is not one of enumerating them but rather one of placing them in relation 
one with another in terms of the theoretical framework. d 

We cannot overemphasize the fact that scientific genius in solving any 
problem is not the result of lucky inspiration, nor is it born from expos- 
ing the problem to a brilliant but vacuous mind. It depends upon a rich 
store of conceptual and factual knowledge arranged comprehensively in 
terms of an over-all theoretical framework. It is this which must be 
transformed into verbal symbols and preserved for future study and 


testing. 


A CONTINUAL INTERCHANGE DEMANDED BETWEEN 
THEORY AND FACT 


We have suggested two very significant phases connected with the 
framing and formulation of a problem: first, the establishment of a the- 
oretical framework, and second, the testing of this framework against 
available fact. The suggestion to start the theoretical phase first was 
made in order to establish conceptual meanings that could be used to 
direct us in the search for facts. Following this it was suggested that a 
thorough search of current knowledge should be made to increase our 
understanding of the factual bases of the theoretical e and to 
discover ways of completing and extending this framework. n l 

The two phases were kept apart primarily for purposes of — 
Actually, in any given problem it is difficult in our own thinking re 
whether some explanatory notion or some item of fact came first. = 
tematic treatment of theory should be started first, but facts shou 10 
low immediately. After we are well embarked on the consideration o 
problem, theory and fact become inextricably interwoven. They poke 
each other and are to be developed interdependently. During any . — 
study, theory should be introduced at any time we are intelligent enough 
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—or fortunate enough—to have ideas about the determinants underlying 
our problem. There is a continual interchange between theory and fact 
as our knowledge about the problem expands and takes form. We should 
then proceed to get all of the facts we can and to apply these facts 
continually as a means of developing sound theory. 

As pointed out before, in defining a problem we are working at the 
hazy, indefinite, and indistinct border between the known and the un- 
known. Ideas for bridging the gap between the two regions seldom will 
appear crystal clear when first recognized. Notions that show no promise 
when first evolved may later prove highly significant. Ideas that seem so 
obvious that they must work very frequently turn out to be “duds.” 
These are common occurrences encountered while secking to evolve a 
problem. In embarking on our search we are better off if we adjust our- 
selves early for a protracted study, one punctuated here and there with 
false clues, but one which, if we are willing to keep going, will eventually 
furnish the information needed to crystallize a particular problem worthy 
of thorough study. 
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CHAPTER 8 


The Use of Hypotheses in Formulating a Problem 


One of the most productive verbal forms to be used in expressing a 
problem is that of the hypothesis. Stating the problem as a hypothesis 
minimizes exposition and leaves the essential elements framed in a brief 
statement, A hypothesis describes a possible future event or condition 
to be discovered. Thus, the problem is not framed in terms of its ante- 
cedents but rather in terms of its implications for future knowledge. In 
the present chapter we shall discuss the use of hypotheses in formulating 
a problem. We shall consider the nature and functions of hypotheses and 
the logic and method used in subjecting them to empirical testing. 


THE NATURE OF HYPOTHESES 


A Definition. A hypothesis is a proposition about factual and con- 
ceptual elements and their relationships that projects beyond known 
facts and experiences for the purpose of furthering understanding. It is a 
conjecture or best guess which involves a condition that has not yet been 
demonstrated in fact but that merits exploration. It may be framed as a 
potential solution to a problem, or as an explanation of some unknown 
fact. It may describe an element or a relationship which, if found true, 
would by logical inference offer support to some explanation or theory. 

This definition restricts the meaning of the term hypothesis. Not just 
any kind of statement of a problem is to be interpreted as meeting this 
definition. Let us consider an example. If we were interested in the un- 
derlying nature of color vision, we could phrase our ideas in the fol- 
lowing statement; Color responses are inherited reactions. This state- 
ment is not a hypothesis. It does not express a specific condition, rela- 
tion, or element to be explored; it does not state a solution to be tested; 


it does not juxtapose factual and conceptual objects in some definite re- 
lationship, 
Suppose we state the problem as: 


activity of different types of cones in 
153 


Color responses result from the 
the retina. This is a more precise 
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statement; it suggests specific conditions that can be explored. It is based 
on the fact that there are retinal cones, and it conceptualizes about the 
functions which these organs might possess. As a best guess it states that 
seeing different colors might be the result of the activity of different 
kinds of retinal cones. 

The Conceptual Nature of Hypotheses. An important characteristic 
of a hypothesis is that it is developed by reasoning and so contains con- 
ceptual elements. Starting from known events and relations we think of 
new possibilities. These may include possible new objects, possible new 
functions, possible new relations—literally, the possible existence of any 
kind of aspect that is known to us or can be imagined by us. It is appar- 
ent that reasoning may know no bounds when it is being applied in the 
formulation of a hypothesis. 

The new aspects of any hypothesis are conceptual in nature. In some 
problems the elements of the hypothesis are perceptual in nature, the 
postulated relation between the elements being conceptual. In other 
problems the elements themselves are conceptualized. The point is that 
every hypothesis contains some element or relation that has never been 
perceptually experienced. To be sure, the conceptualized aspect has been 
formed out of the experiences of the past, but it may vary so radically 
from known objects or relations as to bear little similarity to them. In 
fact, the conceptualized aspect may never exist in the sense of being 
experienced perceptually. Its acceptance, then, is based on relevant facts 
from which the existence of the conceptual aspect is inferred. 

In our example from the field of color vision, there are two known 
or “perceptualizable” elements: the fact of seeing different colors, which 
lies within the capacity of almost all individuals, and the existence of 
retinal cones, which is accepted as a characteristic of the retinas of all 
but a few individuals. The conceptualized aspect is the postulation of 
different cones for different colors, Through reasoning we formed the 
notion that different color sensations result from the functioning of more 
than one type of cone. We did not specify the number of types of cones 
to be expected. Actually, of course, this hypothesis has been the subject 
of study, and several sound empirical tests have been made of it. 

Let us consider another hypothesis. It is an accepted fact that the rods 
of the retina mediate colorless sensations—the black-gray-white sequence 
of sensations. It is known that the rods are activated by chemical changes 
produced by light in a substance called thodopsin or visual purple. This 
substance completely surrounds the rods. Suppose we are interested in 
knowing how the cones are activated. Reason would dictate that we as- 
sume the existence of a substance surrounding the cones comparable to 
rhodopsin. Our hypothesis might be: The cones are activated by chemi- 
cal changes in a substance that surrounds them. Here we are conceptual- 
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izing the nature of the processes that produce cone activity. We postulate 
a substance surrounding the cones. We further postulate that it is this 
substance that is changed by light and that in turn activates the cones. 
Again, several studies have been made exploring this hypothesis. To 
date, however, the evidence is not as complete as the evidence available 
on rod functions, and therefore additional studies are in order. 

Verbalization of Hypotheses. Hypotheses vary in the extent to which 
their elements and relations can be accurately depicted. The accuracy 
of the description depends upon the degree to which the postulated 
aspects can be traced to relevant existing facts. 

Sometimes we may entertain implicit, nonverbalized hypotheses, that 
is, hypotheses that we have not thoroughly developed and therefore can- 
not accurately express. If we attempt any experimentation with such a 
hypothesis, however, we shall be working in the dark. The anticipation 
of obtaining significant results from our efforts is predicated more upon 
hope than upon knowledge. A hunch is not a hypothesis. It is the sub- 
jective beginning of an idea, and must undergo considerable analysis 
and development before it can be called a hypothesis. Through theoreti- 
cal and factual analyses we must objectify the hunch. We must determine 
its constituents and associate these constituents with all facts known to 
be relevant to them. This process of objectification enables us to separate 
those elements or relations that can now be accepted as factual and those 
that, being largely conceptual, must be further studied and tested. 

Sometimes we may devise a hypothesis prematurely, that is, we may 
verbalize our hunch before completing our analysis of the problematic 
situation, This undue haste results in error. As a consequence of our 


premature verbalization we may restrict too greatly the range of our 
a biased interpretation and attack. We 


investigation. We may develop 5 a 
may fail to detect and control all of the determinant variables. We may 
waste time and effort in following false leads and collecting data of little 
consequence to the primary issues of our problem. 

f hypotheses directly depends upon 


The accuracy of verbalization o 6 
the accuracy with which we objectify the elements and relations of the 


Problematic situation. Obviously, as the elements and relations of our 
bjectified, they lose their indefinite- 


problem become more and more 0 s i 
ness and ambiguity. Gradually the hypotheses needing study crystallize 
in a form that can be accurately verbalized. Eventually, if we persevere 
in our analysis, we reach a point where our understanding is sufficient 
to enable us to describe hypotheses that contain testable theorems. 

Let us refer to our example on color vision. Suppose we formulate a 
hypothesis about the function of the cones before we collect all available 
Pertinent facts. We have learned that the rods and cones differ in certain 
anatomical features. They differ in shape, size, distribution in the retina, 
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and in the nature of their neurological connections with the cortical 
visual centers. With only these facts before us, suppose we prematurely 
formulate a hypothesis as follows: The function of the cones is different 
from the function of the rods. Very likely this statement is true, because 
of the anatomical differences just noted, but we framed the hypothesis 
before we were aware of all of the factual and conceptual elements and 
relations. The hypothesis as stated is ambiguous and indefinite. It does 
not specify some condition that can be empirically explored. 

We now continue our search for facts. We learn that the retinas of 
nocturnal animals contain mainly rods, few cones. From this we reason 
that because we do not see colors in dim illumination, for example, at 
night, probably these animals do not see colors at all. Their visual experi- 
ence might be restricted to brightness vision. Continuing our search we 
learn that occasionally a human being is found who is completely color- 
blind. This individual experiences pain in bright illumination; he has 
what might be called “daylight blindness.” At night, however. his vision 
is good. We now recall one of our first learned facts, that cones are found 
in large numbers in only the central portion of the retina. To this we 
add a related fact that through reflex action the pupil of the eye is re- 
stricted in bright illumination so the light rays tend to reach only the 
central region of the retina. At this point in our thinking we reintroduce 
the facts that we learned about the vision of nocturnal animals. From 
all of these facts we now reason that the totally color-blind individual 
must have a peculiarly constructed retina, probably mainly rods, because 
his visual reactions resemble those of the cone-free nocturnal animals. 

With the addition of the few facts given above we are now in a posi- 
tion to improve our hypothesis. We could set up the hypothesis that 
cones are the retinal organs for seeing color. We could then set up 
theorems to collect facts relevant to this hypothesis. One theorem would 
be that animals like the owl cannot discriminate colored objects. An- 
other theorem would be that totally color-blind humans have a cone-free 
retina. The new hypothesis is tied to more facts than our original hypoth- 
esis. It is more highly objectified and less ambiguous than our original 
hypothesis. It specifies conditions that can be formed into empirically 
testable theorems. 

The Forward Reference of Hypotheses. A hypothesis has a forward 
reference in the sense that it contains a conceptual element or relation 
that requires further examination. It is necessarily a rational leap be- 
yond known facts and experiences. 

In everyday life we carry knowledge forward and apply it to situations 
similar in nature to those in which the knowledge was discovered and 
developed. Much of the time we do this without verbalizing about it 
and without explicitly framing any hypotheses. As the new situations 
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vary more and more from familiar past situations, a given application is 
less likely to succeed. We then become more and more aware of the need 
for further analyses by which we can determine the nature of the situa- 
tions in which we can reasonably expect the application to succeed and 
the limits beyond which we can expect it to fail. At this point we are in 
need of a hypothesis. Through a hypothesis we can set up a situation for 
empirically establishing the limits within which we can expect success. 

A hypothesis, then, anticipates nature and proposes certain conditions 
that might be found to exist, but that do not now exist so far as we 
know. It is a guess that needs to be explored. It must be demonstrated 
that the conceptual aspects of the hypothesis are supported by empirically 


derived facts. We set up an investigation to test the implications of the 


hypothesis and thus to collect evidence concerning these conceptual 


aspects. 


THE FUNCTIONS SERVED BY HYPOTHESES 


ld keep in mind the fact that there 


In the following discussion we shou 
alized, and logically unsound 


are inadequately formulated, overgener: 
hypotheses which do not fulfill the accepted functions of a hypothesis. 


The statements which follow apply to hypotheses that have evolved from 
a thorough and comprehensive analysis of the problem; an analysis simi- 
lar to that which was described in the preceding chapter. 

Hypotheses as Explanations. One of the most important functions 
served by a hypothesis is that of explanation. The effectiveness of a 
hypothesis as an explanation is judged in terms of the significance of the 
meanings that it introduces into the situation containing the unknown 
factors. 

Let us consider this explanatory function. The resolution of a problem 
requires postulation of conceptual elements and relationships. Examina- 
tion of the observable data has shovm them to be ambiguous and incom- 
plete. There are elements that appear unordered and unrelated to any 
other elements that are known. There are unknowns for which there are 
no satisfactory meanings or interpretations. The tasks needing to be done, 
then, are to ‘complete the data, to detect the potential meanings and 
relationships, and to order the phenomena. These are the explanatory 


functions served by hypotheses. , a 
Hypothetical explanatory elements and relations are the principal 
means for setting up trial solutions for a problem. Conceptualizing makes 
lations not directly observ- 


Possible the introduction of elements and re 
able, thus enabling us to proceed beyond the known data. A hypothesis 


may contain conceptual elements that complete the observable phe- 


nomena, conceptual relationships that organize the unordered elements, 
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or conceptual meanings and interpretations that are applicable to the 
unknown factors. 

Hypotheses as Stimuli to Research. Another general purpose served 
by the hypothesis-making function concerns the origination of problems 
for research study. Hypothesis making literally becomes a mental char- 
acteristic of the scientist—a kind of habit of thought. It serves as a gen- 
eralized frame into which he can thrust all of his uncertainties and diffi- 
culties, and he does just that. We must grant that scientific method is 
primarily in the hands of specialists. In serving the specialist in science, 
the hypothesis-making function is a device for creating and crystallizing 
problems for investigation. As a type of mental set it makes him receptive 
to problematic situations. His habitual use of the function then enables 
him to see problems that otherwise would escape his notice. 

Hypotheses as Sources of Methodology. In the formulation of a hy- 
pothesis, procedures are often suggested that can be used in the empirical 
testing of the postulated solution. Frequently the hypothesis is phrased 
as a conditional sentence the consequences of which are pertinent to the 
solution. In phrasing the conditional statement we are led to examine the 
pertinent variables. We are also led to methods for empirically testing 
the points at issue. These methods are usually revealed by simply fol- 
lowing the implications that arise from the hypothesis. 

Let us do some reasoning about the accident-prevention problem and 
see how methods are suggested by the hypotheses we develop. It was 
suggested earlier that fatigue might be a determiner of accidents. If this 
were true, then more accidents should occur when drivers are fatigued. 
In considering this idea we are led to the possibility of collecting acci- 
dent statistics at the times drivers are fatigued. We would probably 
conclude that for most drivers the greatest fatigue comes at the end of the 
working day. This suggests getting accident statistics between the hours 
of 4:30 and 6:30 p.m. After a little thinking about this idea we would 
realize that there might be factors in addition to fatigue that would 
cause accidents during this period, such as congestion due to workers 
returning to their homes, or lowered visibility in the wintertime during 
these hours. In formulating the empirical testing situations, we would also 
want to collect accident statistics when traffic was congested at hours 
other than between 4:30 and 6:30 p.m, and we would control visibility 
by not using accident cases that could be attributed to this factor. If we 
did find more accidents during the hours from 4:30 to 6:30 p.m. than 
during any other 2-hour period of the day, we would want to be assured 
that the greater frequency during the evening hours was caused by 
fatigue of the drivers and not by congestion or lowered visibility. With 
these ideas in mind we might continue our thinking and turn up other 
factors that would differentially affect the driving picture at these hours. 
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Following this, we would seek means for eliminating or controlling these 
factors in the study we anticipated making. It should be noted that in 
this brief period of reasoning, various methodological suggestions have 
appeared as “part and parcel” of our thinking. 

Hypotheses as Criteria for Evaluating Experimental Techniques. A 
hypothesis often sets conditions against which we can judge the appro- 
priateness and adequacy of the instrumental or statistical procedures of 
our empirical test. When the solving of a problem requires the use of 
apparatus, we must have some means of determining when the apparatus 
is adequate for the solution. We really have only one criterion and that 
is the hypothesis we are going to study. If the apparatus enables us to 
collect data in the form required for checking the implications of the 
hypothesis, then it is to be considered adequate. This logic is not re- 
stricted to evaluating apparatus but is applicable to all procedures and 
operations, including any statistical designs we may need to employ. 
Any procedure is declared appropriate and sufficient when the ends 
served by it satisfy the conditions of the hypothesis. 

Let us consider an example. One of the hypotheses suggested in the 
accident problem concerned glare blindness. Inability to recover rapidly 
from bright light stimulation might contribute to accidents. It is only a 
short logical step from this notion to the idea of examining the glare- 
recovery times of all drivers who have accidents at night. This, of course, 
presumes that we can obtain or devise an apparatus for measuring the 
glare-recovery time. The hypothesis requires that we be able to measure 
reliably individual differences in time of recovery from glare stimulation. 
An apparatus that does this will meet this particular condition of the 
hypothesis, 

Hypotheses as Organizing Principles. A hypothesis serves as an or- 
ganizing principle around which all pertinent knowledge can be re- 
lated. Thus it aids in establishing the relative emphases to be place on 
different aspects of the problem. It contributes to the determination of 
the temporal direction that any effort toward a solution will take. 

The hypothesis, as an organizing principle, aids in determining the 


needed coverage of the several phases of a problem. The scientist, like 
8 le to follow that will enable him 


all other workers, needs some princip 
a o tell when he has collected 


to tell when his task is finished; that is, to te e 
a sufficient number of facts to test the solution adequately. He might 


stop his experiment prematurely or he might continue it longer than 
necessary. If the hypothesis is stated too generally and indefinitely, the 
scientist never knows when he has collected a sufficient amount of data 
for empirically testing its implications. When the hypothesis is so stated 
that its implications are amenable to statistical evaluation, then the scien- 
tist can determine rather precisely when he has enough data collected. 
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In the example in which we postulated fatigue as a determiner of auto- 
mobile accidents, it was suggested that other factors would have to be 
brought under control. Accepting fatigue as the experimental variable 
establishes an organizing principle through which the importance of these 
other factors can be determined. To a very significant degree it deter- 
mines what action should be taken in reference to time of day, conges- 
tion, speeding, and similar factors. 

If we make our fatigue hypothesis more explicit, we can take advantage 
of statistical procedures devised for quantitatively evaluating hypotheses. 
This requires that we derive a theorem that can be quantitatively tested. 
Such a theorem might be stated as follows: More accidents will occur at 
the intersection of F and K streets between the hours of 4:30 and 6:30 
than between the hours of 11:30 and 1:30, when the volume of traffic is 
approximately equivalent. This theorem can be accurately evaluated by 
conducting an empirical test and statistically evaluating the results. It 
will be noted that in deriving a theorem amenable to measurement We 
substituted accidents at stated hourly periods for the concept of “fatigue” 
as the object of immediate concern. Should the findings support the 
theorem, it would still be necessary for us to establish as fact the rela- 
tionship that the greater accident frequency during the afternoon period 
arose from the greater fatigue of the drivers, 


HYPOTHESIS IN RELATION TO FACT, THEORY, AND LAW 


Hypothesis and Fact. Knowledge in the 


from studying the facts; knowledge in the 
from the substantiation of h 


form of description comes 
form of explanation comes 
ypotheses involving conceptual elements and 
relationships. Although this statement expresses the relationship between 
facts and hypotheses as knowledge-gaining mechanisms, it fails to point 
out that at times it is difficult to distinguish the one from the other. 
There are times when the content of a hypothesis approaches the status 
of a fact, and times when the substance of a fact approaches the status 
of a hypothesis. 

In the beginning, a hypothesis contains unobservable conceptualized 
elements and relationships. In this form it serves as an explanatory 
mechanism. Empirical evidence supporting the hypothesis may reveal 
the actual existence of the elements and relationships that are postu- 
lated, and thus they are accepted as facts, This transition from hypothesis 
to fact usually is not accomplished in any single study, and it may require 
a large number of investigations and take place over the span of many 
years. 

At any time, however, these conceptualized elements and relationships 
now accepted as facts can be brought into question, if new evidence 
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serves to discredit them. Thus, what once was considered fact may no 
longer be accorded this status. The former facts, however, may continue 
to be accepted as the most likely hypothetical explanations currently 
available. 

It is obvious that with facts changing to hypotheses and hypotheses 
changing to facts there is not always a clear distinction between the two. 

One of the suggested hypotheses concerning color vision was that 
there is a substance surrounding the cones that activates them when it is 
chemically changed by light rays. The existence of this chemical sub- 
stance was purely hypothetical when the idea was first enunciated. Since 
then, considerable evidence has been collected that supports the con- 
tention. Today, many workers in the field of color vision maintain that 
the cone substance should now be accepted as fact. 

Hypothesis and Theory. A distinction can be made between the two 
terms hypothesis and theory even though they are often used inter- 
changeably. A hypothesis and a theory are alike in that they both are 
conceptual in nature and have as their primary function the explanation 
of natural events, The difference between the two concerns the extent 
or complexity of the area encompassed. Hypotheses are more restricted 
in their coverage. Theories are more general in nature. A theory may 
contain several hypotheses. When there are several interrelated areas of 
phenomena to be explained, more than one hypothesis may be necessary 
to account for all of the varied phenomena. The several hypotheses are 
usually mutually compatible and supplementary and can be fitted to- 
gether into a more inclusive and comprehensive conceptual scheme of 


a theory. 


Hering’s theory of color vision contains a number of hypotheses. He 


postulated the existence of three retinal processes underlying visual ex- 
perience, one process for red and green, one for blue and yellow, and 
one for black and white. He postulated two chemical phases in the proc- 
esses; one he called anabolism (building up), and the other he called 
katabolism (tearing down). He postulated that the anabolic phase gave 
tise to green, blue, and black, and that the katabolic phase gave rise to 
red, yellow, and white. He postulated further that color mixtures and 
blends resulted from the simultaneous activity of two or of all three of 
the processes, either in the anabolic or katabolic phases. Further hy- 
potheses concerning the retinal processes were suggested to account for 
the phenomena of color blindness, peripheral color sensitivity, negative 
and positive afterimages, and other visual experiences. f 
Hypothesis, Theory, and Scientific Law. Scientific law differs from 
both hypothesis and theory in that it has received sufficient verification 
and confirmation in fact to be accepted with little question. Like hy- 
Pothesis and theory, law retains the explanatory function. In backward 
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reference, it is used as a means of accounting for the occurrence of phe- 
nomena. In forward reference it is used as a means of predicting what 
is to occur. 

Before attaining its current status, a law was either a hypothesis or : 
theory. Its continued acceptance as a law is predicated upon its canine 
success in accounting for the events that it is designed to encompass. 
It may be called into question when new findings are found not to had 
form to its tenets. It may then revert to the status of a hypothesis or 
theory, and its acceptance will be continued in terms of its usefulness as & 
hypothetical explanation. If sufficient negative evidence arises to demon- 
strate the law to be false, it may be completely abandoned. 


FACTORS CONTRIBUTING TO THE ORIGINATION 
OF HYPOTHESES 


Originating a hypothesis is one phase of the scientific method that 
must be learned through hard work and the application of trial-and-error 
procedures. We can be dogmatic to the extent of saying that there are no 
prescriptions which, if meticulously followed, will guarantee success to 
the uninitiated. We can state arbitrarily that a hypothesis cannot be 
“conjured up” in a vacuum. ae 

Generalizing beyond Results of Previous Investigations. Sometimes 
hypotheses are readily formulated by generalizing beyond the findings of 
previous studies. The procedure may consist in applying the logic of sini? 
larities and inferring that because some principle X worked for condition 
A it therefore should work for condition B, which has many of the char- 
acteristics of condition A. For example, rhodopsin having been found 
surrounding the retinal rods, we could postulate that there should be @ 
similar substance surrounding the retinal cones. . 

The procedure of generalizing beyond previous findings may consist 
in applying the logic of differences. We infer that principle X, which 
worked successfully for condition A, must be modified before it will work 
for condition B, because condition B differs from condition A in certain 
important respects, For example, it has been found that the cortical 
brain in rats is undifferentiated in respect to the problem of maze learn- 
ing. That is, no part of the cortex is more important for the maze-running 
habit than is any other part. The cortex 
For the problem of brightness discrimination, we probably should not 
think of all parts of the cortex as equipotential because there is a specific 
visual area in the cortex of most mammals. The hypothesis could be re- 
stricted to the cortical visual area instead of the whole cortex. The hy- 
pothesis then would state that within the cortical visual area the brain 


is equipotential for this habit. 
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is undifferentiated in respect to the habit of brightness discrimination. 
This hypothesis has been confirmed by several experimental studies. 
Analyzing Factual Conditions Needing to Be Explained. Hypotheses 
may often be formulated from thinking about possible determiners of 
some factual condition for which there is no satisfactory explanation. In 
any area in which we desire to construct hypotheses we must be aware 
of both the gaps that exist in our knowledge and the theoretical security 
and boundaries of our facts. Thorough acquaintance with the gaps and 
with the relevant theoretical background makes more probable the dis- 
covery of possible explanations. We cannot expect conceptual discoveries 
from vacuous minds. 
In the accident problem, we do not have a satisfactory explanation for 
the fact that about 50 per cent of automobile accidents occur at night, 
although only one-third of automobile traffic occurs after daylight hours. 
Examining the problem further, we might be led to such factors as 
sleepiness, lowered visibility, fatigue, glare blindness, faster speeds, etc., 
as possible explanations. Any one of these factors could be stated in the 
form of a hypothesis and set up as a potential explanation for the high 


frequency of night accidents. 
The Intellectual Equipment o 
function depends for success upon tl 


vestigator. Keenness of observation, tec k 0 
ships, breadth and intensity of imagination, facility for manipulating 


concepts, etc., are aspects of intellectual equipment that every scientist 
must cultivate as essential to his profession. When appropriately applied, 
these intellectual qualifications result in greater numbers of hypotheses 


and in more fruitful hypotheses. 
Inspiration as a Source of Hypotl 
this method as a means of evolving 


with the methods of the great scientists ; sud 
appearance of hypotheses in dreams during sleep or in flashes of insight 


at some seemingly inappropriate moment, we shall find that they advo- 
cate much the same procedure we have suggested in the theoretical and 
factual analyses of the problem as described in the preceding chapter. 
One fact, of which we can be fairly certain, is that the wild guesses of 


the unsophisticated are not the high-quality hypotheses that mature men 


of science have claimed they have received suddenly, as by some inspira- 
tion. The insightful flashes of the immature and uninitiated have no claim 
for immediate verification. Their empirical testing should wait upon 
thorough theoretical and factual analyses of the problem. 


f the Scientist. The hypothesis-making 
he personal qualifications of the in- 
ability to detect potential relation- 


veses. Little credit should be given to 
hypotheses. If we acquaint ourselves 
s who have claimed the sudden 
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THE DEDUCTIVE ELABORATION OF A HYPOTHESIS 


There are five stages in the development and the confirmation or dis- 
confirmation of a hypothesis. (1) We start with empirical facts, with data 
that are in need of explanation and further understanding. (2) From 
these data, through a procedure of rationalizing, we derive a hypothesis, 
a conceptual explanation that helps us to understand the data. (3) We 
then evolve from the hypothesis implications that show promise of being 
amenable to empirical study and that can be stated as theorems having 
certain consequences. (4) Next, we devise factual situations that will test 
the theorems. (5) Lastly, we conduct the empirical tests, collect the 
relevant facts, and confirm or disconfirm the hypothesis in terms of 
these facts. In these several stages we complete a cycle from the initial 
facts through the theoretical development of the hypothesis to the collec- 
tion and evaluation of additional facts. The deductive elaboration of the 
hypothesis is begun in stage 2. It is the primary task of stage 3. 

Separating the development and confirmation-disconfirmation of a hy- 
pothesis into five stages assists in gaining an understanding of what goes 
on in the scientist's thinking when he evolves a hypothesis. It is to be 
understood that these stages are not always clearly recognizable. Further, 
there is not full agreement in the use of the terms we have adopted for 
representing these stages. Many scientists do not restrict the word hy- 
pothesis to stage 2 but use it loosely to stand for the implications and 
theorems, stage 3, and sometimes to stand for the empirical testing situa- 
tion, stage 4. The authors believe that a more rigorous use of the term is 
needed, and that not just any kind of problem that proves’ amenable to 
study should be dignified by the word hypothesis. 

The Meaning of Deductive Elaboration. This is the step in which, by 
realistic and incisive thinking, we discover the bridges between the 
known and the unknown. It is both one of the most important and one 
of the most difficult individual procedures in the execution of the scien- 
tific method. 

In deductive elaboration we endeavor to detect and develop as many 
implications as we can logically evolve from the hypothesis, and to frame 
them in the form of theorems. Every hypothesis is based upon certain 
postulates and assumptions which we should examine carefully for im- 
plications. The assumptions and postulates will imply certain elements 
and relationships that presumably must exist if the hypothesis is valid. 
These elements and relationships issue logically from the hypothesis. 
They can be stated in the form of theorems having certain consequences: 
We bring evidence to bear on the hypothesis through a study of these 
theorems. 
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A simple formalized procedure that can be utilized is as follows: If 
postulate A is true, then under condition M certain X consequences will 
occur. The phrase “if postulate A is true” refers to an assumption of the 
hypothesis. The phrase “under condition M certain X consequences will 
occur” is a theorem. 

The significance of any hypothesis is to 
It is through its implications that the hypothesis “reaches out” toward 
the unknown. The theorems and their consequences, which are logically 
deduced from the implications, serve to force the implications into a 
form that can be empirically examined. The hypothesis itself is not di- 
rectly tested, rather, it is attacked indirectly through a study of its 
theorems. We can use the facts collected about the theorems as evidence 
for evaluating the hypothesis only if the relationship between the theorems 
and the implications and the relationship between the implications and 
the hypothesis are logically sound. 

Let us consider the factor of glare vision in our problem of reducing 
automobile accidents. Some of the facts from which we could initiate 
further studies are that a large proportion of automobile accidents occur 
at night, that at night the driver is temporarily blinded by the bright 
headlights of oncoming cars, and that people differ markedly in their 
rates of recovery from bright light stimulation. From these facts we 
might evolve the hypothesis that slow recovery from glare stimulation 
is a determiner of night accidents. From this hypothesis we can derive 
the implication that ‘drivers with slow rates of recovery might be re- 
sponsible for a disproportionate number of night accidents. We can 
phrase this implication as a theorem as follows: A study of night accidents 
would reveal that a disproportionate number of drivers would report 


that they did not see distinctly just preceding their accident because of 
the glare from the headlights of oncoming cars. Whether or not we could 
substantiate this theorem would depend upon the results obtained in 


subsequent factual studies. 5 a i 
The Conceptual Nature of the Deductions. How much it is necessary 
to postulate nonobservable elements and relationships in a hypothesis 


depends upon the nature of the problem and the kind of solution that is 
desired, 
Sometimes the solution may be o 


be found in its implications. 


btainable with little conceptualization. 


Some implications developed from à hypothesis may be confirmed from 
a reconsideration of past observed events or by new evidence readily ob- 
tainable. Thus, the implications may issue in consequences that can be 
quickly and directly studied. The new elements or relationships required 
are revealed in immediate experience. In such instances the new postu- 
lated elements and relations required by the hypothesis are closely asso- 
Ciated with factual and theoretical elements and relations that are known. 
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When immediate empirical evidence is found to support a hypothesis, 
our grasp of the problem is greatly strengthened. Such evidence may even 
provide the solution to the problem. In such a case, the hypothesis does 
not attain a high level of abstraction and the conceptual elements are 
not far removed from empirical phenomena. In the example dealing with 
glare recovery as a cause of accidents, the conceptual elements were not 
far removed from available facts. 

The more difficult problems of deductive elaboration occur when the 
postulates and assumptions of the hypothesis involve elements and rela- 
tionships that are not directly observable with any known techniques, 
and, in fact, may never become directly observable. The postulated con- 
cepts of the hypothesis then have meanings that are several steps re— 
moved from empirical phenomena. Despite the difficulty of the task, we 
must deduce theorems from these implications that will issue in empirical 
testing situations. Hering’s hypothesis concerning the anabolic and kata- 
bolic processes yielding the various colors has so far not proved amenable 
to experimental attack. 

Sometimes when a scientist is working on a large research program 
he will introduce highly abstract and conceptualized constructs into the 
deductive elaboration of a hypothesis when the solution in terms of im- 
mediately observable events is “begging” for his recognition. There is 
justification for this behavior. For most of us, the solution of the immedi- 
ate problem consumes all of our attention. For the theoretician, the prob- 
lem is one of intellectual curiosity. There is no urge for an immediate 
factual type of solution. The individual hypothesis is not his only concern. 
The hypothesis may be just one part of a larger theoretical scheme. The 
contribution to his over-all theory of a highly conceptualized solution of a 
single hypothesis may far outweigh an immediate factual solution. In 
other words, the scientist may postulate an elaborate system of con- 
ceptual entities and relations for the immediate problem because they are 
appropriate for other problems in his larger theoretical system. The im- 
mediate problem gives him an opportunity to get empirical evidence for 
this larger scheme. 

The Duration of the Deductive Elaboration. In our deductive analysis 
it is a good policy to continue the elaboration as long as we are evolv- 
ing what seem to be testable ideas about the problem. We should deduce 
as many implications as we can. Even what appears to be a rather remote 
implication may, through further knowledge, lead to significant results. 
Let us be reminded again that we are working in the region between the 
known and the unknown, and if we are fortunate enough to have numer- 
ous ideas about the implied consequences of the hypothesis, we certainly 
should describe as many of them as we can. A comprehensive deductive 
elaboration in which some of the theorems prove to be untestable is to be 
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preferred to the procedure of immediately setting up an empirical test 
for the first theorem that is evolved. 

The deductive elaboration of a highly significant hypothesis is a process 
that may go on for years, occupying the attention of one and then another 
investigator. Seldom does one individual achieve a comprehensive elab- 
oration and justification of all the implications of such a hypothesis. 

The Need for Thorough Theoretical and Factual Analysis. It should 
be obvious that before we begin the deductive elaboration of a hypothesis, 
we should make careful analyses of the theoretical and factual back- 
grounds of the problem. The framing of the hypothesis depends upon 
these analyses. Very likely the hypothesis we decide to investigate is the 
last of a succession of trial hypotheses that we have evaluated. The pro- 
cedure is mainly one of trial-and-error reasoning, and trial hypotheses 
should be evaluated in terms of logical consistency and logical pertinence 
before they are considered acceptable for further study. At this stage, 
trial hypotheses may be discarded because they are not logically con- 
sistent with the theory of the determinate factors of the problem, or be- 
cause they are not very pertinent to the questions needing to be answered, 
or because they offer little in the way of implications that can be in- 
cisively tested. Success in correctly choosing a significant hypothesis and 
in adequately elaborating the hypothesis chosen directly depends upon 
the thoroughness with which we have developed the theoretical and 


factual backgrounds of the problem. 


THE EMPIRICAL TESTING OF THE THEOREMS 


ve elaboration of his hypothesis, an in- 
d to use the hypothesis as an explanatory 
bstantiation for the relevant 


After completing the deducti 


te will sometimes procee 
evice before he has found factual su 
theorems. Such premature application is frequently made by the non- 


Scientific person and occasionally by the scientist. It is not to be condoned. 

The Need for Empirical Testing. A hypothesis is not valid until there 
has been an opportunity to test the theorems against facts. It retains 
the characteristics of a guess until factual support is forthcoming. Its 
Use may give the investigator satisfaction, because it purportedly accounts 
for bothersome unknowns-thus giving some relief from uncertainty. Or 
its use may prove satisfying because it completes an explanatory system, 
taking care of certain annoying loose ends. It should be realized, how- 
ever, that “satisfyingness” is a matter of feeling and not a matter of fact. 
Logical completeness or logical consistency do not produce facts; there- 
fore, these processes cannot confirm hypotheses. Only an exposure of the 
theorems of the hypothesis to empirical testing will confirm or disconfirm 


the hypothesis. 
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The use of the concept of accident proneness as a cause of automobile 
accidents is a good example of the utilization of an explanatory hypothesis 
before it has been adequately confirmed. Many workers in the field of 
accident prevention get great satisfaction from this concept because it 
comes in so handily in their discussions on why accidents happen. Its 
widespread use is not justified, however, because little succsess has been 
achieved in subjecting the concept to empirical verification. 

The Nature of the Empirical Test. Formal logic plays an important 
role in the testing of theorems. In the deductive elaboration of the hy- 
pothesis we found that the implications must follow logically from the 
assumptions and postulations of the hypothesis and that the theorems 
must follow logically from the implications. In the present step we must 
be certain that the empirical test we organize logically satisfies the con- 
ditions of the theorem. It must relate the elements in the manner stipu- 
lated by the theorem. Of course, the test should measure the particular 
theorem under consideration and not some other theorem. In evaluating a 
hypothesis, we say that if we can demonstrate the truth of the theorem 
through empirical testing and the theorem is logically derived from the 
implications of the hypothesis, then we have evidence that supports the 
hypothesis. This appears to be the only possible approach when the as- 
sumptions and postulates of the hypothesis are conceptually removed 
from empirical observation. 

A check can be carried out on a theorem regardless of the state of our 
knowledge regarding the postulates and assumptions from which the 
theorem is derived. The postulates may be wild guesses, but as long as 
they issue in implications that eventually can be stated as theorems and 
subjected to empirical study, they may be productive and deserve em- 
pirical study. 

The empirical test can be of almost any variety, provided it furnishes 
factual material bearing on the theorem. The most important features 
of the testing situation are those that determine whether it accurately 
and adequately represents the variables, conditions, relationships, ete» 
of the theorem being tested. Only if this is true is the test a valid one for 
the particular theorem. Representativeness can be achieved by many 
different kinds of empirical situations. The experimental laboratory Ap, 
proach is one procedure. Other methods, such as controlled observation of 
naturalistic situations and procedures involving psychological tests, in- 
ventories, interviews, surveys, and case histories, will also serve the 
purposes of an empirical test, provided they adequately represent the 
conditions of the theorem. 

The Procedure for Devising an Empirical Test. In order to devise a test, 
we search for an empirical situation in which we can demonstrate the 
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theorem, The theorem is a statement of a relationship presumed to hold 
between two orders or kinds of phenomena. We ask ourselves the ques- 
tion: What are some examples of these two orders in which the condi- 
tions of the theorem can be demonstrated? When a factual situation is 
found in which phenomena of the two orders can be made to agree 
with the conditions of the theorem, we have devised an empirical test. If 
we are not able to find an empirical situation encompassing the condi- 
tions of the theorem, we probably have an untestable theorem. If so, we 
must return to a further deductive elaboration of the hypothesis in order 
to evolve a theorem that can be subjected to empirical examination. 

In the example of glare blindness and night accidents, we developed 
the theorem that in a study of night accidents a disproportionate number 
of drivers would report that they did not see well just preceding the 
accident because of the glare from the headlights of oncoming cars. At 
first thought, it might be considered relatively easy to set up a factual 
test of this theorem. For instance, we could just ask individuals having 
night accidents if the glare of headlights affected their car control just 
preceding their accidents. This study, however, would be difficult to 
execute, We would not be able to observe the accident conditions first 
hand and so would have to rely on accident records for much of our data. 


Accident records are notorious for their deficiencies. We would have to 
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in a test are seldom to be gainsaid. But even though we are seldom justi- 
fied in challenging the facts, we most certainly are justified at any time 
in challenging the logic by which the facts are related to the theorem. 

In a test of the theorem developed for the glare-blindness hypothesis, 
the principal conditions would be: that the test involve accidents occur- 
ring under night-driving conditions, that accident cases be found in 
which the driver had to report that he was or was not glare-blinded from 
the glare of headlights just preceding the accident, and that an adequate 
sample of night accidents be studied. For the results to favor the theorem, 
we would have to find that a disproportionate number of drivers reported 
being glare-blinded just preceding their accident. It should be obvious 
that it would not be an easy task to find empirical situations that would 
satisfy all of these conditions. 

We may be inclined to think that facts favorable to a theorem are the 
only desirable or useful kind. If the test is adequate and the theorem 
pertinent to the hypothesis, negative results are as significant as positive 
results. They may not be as pleasing to an investigator who is greatly 
enamored of his hypothesis, but they will be scientifically important— 
indeed, they may be more important than positive findings. 

Let us refer to a hypothetical illustration. Suppose that in the studying 
of some disease such as infantile paralysis, there had been developed a 
hypothesis that a certain X form of organism was the primary determiner, 
and that millions of dollars had been spent trying to identify the organism. 
Now, suppose we hit upon a theorem the consequences of which, if posi- 
tive, would strongly indicate that this X organism was the primary de- 
terminer. We conduct a test which authorities in the field believe soundly 
represents the theorem, and get negative results. Such negative findings 


atu have a powerful effect upon any subsequent research on the 
isease. 


THE PROBLEM OF CONFIRMING A HYPOTHESIS 


In the previous discussion, it was pointed out that we do not directly 
test a hypothesis, but rather we test theorems that evolve from the im- 
plications of a hypothesis. The substantiation of a hypothesis, therefore, 
is not accomplished directly. It is achieved indirectly through its theorems: 
It is therefore important that we understand the process of confirming à 
hypothesis. 

The Meaning of Confirming a Hypothesis. Logical Considerations: 
There are several steps of logical relationships involved between the 
facts obtained in an empirical test of a theorem and the hypothesis to be 
confirmed. Stated briefly they are as follows: (1) The implications shoul 
follow logically from the assumptions and postulates of the hypothesis- 
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They should, of course, involve the significant constituents of the problem. 
(2) The conditions, elements, relationships, etc., of the implications 
should be contained in the theorem. (3) These same conditions, ele- 
ments, and relationships should function in a representative manner in 
the testing situation. (4) The facts collected in the empirical study 
should be favorable to the theorem. If any one of these relationships is 
not realized, then the evidence cannot be accepted as substantiating the 
hypothesis. To repeat, a hypothesis is not confirmed if the results of the 
test do not correspond to the consequences stated in the theorem, or if 
the test situation does not adequately represent the significant aspects of 
the theorem, or if the theorem does not include the conditions, elements, 
and relationships of the implications, or if the implications do not logically 
issue from the important constituents of the hypothesis. 

These several logical relationships are the essential elements of the 
indirect attack upon the hypothesis. Only through such a series of rela- 
tionships can we transfer the factual evidence of the test to the problem 
of validity of the hypothesis. A failure at any point of the series invali- 
dates the transfer of the facts from the test to the hypothesis. 

“Confirming” Compared with “Proving” a Hypothesis. Up to this 
Moment we have not found it necessary to define the term confirm. Its 
meaning should now be made clear so we will not confuse it with the 
More popular term prove. ; 

nnn means 5 make firm or firmer, to strengthen, to substantiate, 
to make valid. These meanings are relative in nature-they signify a slid- 
ing scale to which we can ascribe the characteristic of more-or-less. Thus, 
for example, the validity of a test is a more-or-less type of characteristic— 
a continuum, the lower end of which reaches to zero and the upper end of 


which approaches certainty. ; : 
5 3 — are never eel The term prove carries the notion of 
certainty,” of “always” or “absolutely true. That is, if a proposition is 
Proved, then there is no further question about its truth characteristic; 
it is established for all time. In this sense we can never prove a hypothe- 
sis. We must remember that a hypothesis is an explanation; that it con- 
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are to 5 we are including within the framework 
of the hypothesis, As long as the elements and relationships remain con- 
Ceptual, so long is there a possibility that the hypothesis is false. Never 
should we consider that because a hypothesis is confirmed by verification 
of its theorems, it is therefore established for all time. There is nothing 


Sacred about a hypothesis. 
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problem in human behavior invites the use of more than one explana- 
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tion. There is ample justification for this state of affairs. ri 1 n 
plex behavior there are usually many different sets of re A ant fa . 
data, all of which need to be explained. Sometimes these data may me 
construed to favor each of several rival hypotheses. There is then nee 
to select one of these for further study. its 
The Basis for More than One Hypothesis. The conceptual — 
hypotheses is what makes possible the devising of more than one hy- 
pothesis to explain a given set of events. A particular hypothesis is com- 
posed of conceptual elements and relationships that are thought e 
sary for explaining some given phenomenon. When positive results are 
obtained in empirically testing a theorem of this hypothesis, it is said 
that the hypothesis is confirmed. A different way of stating this is to say 
that the results obtained in the empirical test are to be attributed to the 
conceptual elements and relationships of the hypothesis. This latter 
interpretation, however, is not necessarily true. There is nothing to pre- 
vent us from imagining several hypotheses, containing a variety of con- 
ceptual elements and relationships, each hypothesis being framed to ex- 
plain the same set of events. Some other hypothesis than the one that 
concerns us may explain the findings as well as or better than the one 
we are championing. By this is meant that the facts 
with the postulates and assumptions of another hypothesis, or are better 
ordered by another hypothesis than by our own. 
One of these other possible hypotheses, a 


present in every study, is the hypothesis of chance—attributing the find- 
ings to the operation of chance fac 
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but seem to be attributable to an unknown set of f 
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t-or-m 
The Hypothesis o 


are more compatible 


nd one that is potentially 


actors that are operat- 
iss way. These are chance factors. 
f Chance in the Study of Behavior. In past discussions 
we learned that in all scientific studies of be 


havior chance factors should 
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man’s responses ion i They are complexly interconnected 
any diverse ways. We are only 
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tribute to our experimental results without our 
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5 hey contribute Positively, sometimes negatively: 
Because we cannot identify them, we are unable to provide controls for 


them. Actually, we do not understand the particular factors which pro- 
duce chance variations in the results of an experiment. We merely have 


suspicions about their nature and function, which we frame in the form 
of assumptions, 
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Knowing that these chance factors may contribute to the results, we 
should always examine the hypothesis of chance. Tests should be made 
to determine statistically the significance of the hypothesis. If chance can 
be used to account for the results, we are then not justified in concluding 
that the results confirm our hypothesis, even though they seem positively 
to support it. 

We are fortunate in having statistical procedures for measuring the 
expected contribution of chance factors in our findings. Appropriate 


mathematical formulas are available by which we can estimate the extent 


to which our results may have occurred from chance determiners. This 
any hypothesis we may be 


knowledge greatly assists us in evaluating 
investigating. These statistical procedures can be found in most books 
on statistical methods. 

Criteria for Evaluating Rival Hypo 
evaluation of our own or another conceptual hypothesis, chance factors 
should be evaluated as possible determiners of the results. Once the hy- 
pothesis of chance has been ruled out, we can then use the following 
criteria for evaluating the several conceptual hypotheses. 

First, we should mention the stimulating effect any hypothesis has on 
subsequent scientific research. Some hypotheses yield more significant 
implications than do others and thus they supply the bases for more 
empirical investigations of the resulting theorems. Through its implica- 
tions, a hypothesis usually gives rise to more problems than it solves, 
Some of which may be only remotely related to the problem for which the 
hypothesis is designed. The criterion of productivity of new ideas is a 


very important one. 

The number of facts that are brought into a system or order by the 
hypothesis is another criterion. The purpose of a hypothesis being to 
explain consequences, the hypothesis that explains the most consequences 
is to be preferred, other things being equal. This is a function of the ease 
with which inferences and implications can be formed from the hypothe- 
sis. Of course, evidence for and against every hypothesis is continually 
changing, so that the relative explanatory merits and power of any hy- 
bothesis can be expected to undergo change as progress is made in per- 
forming empirical tests. It is, then, possible for rival hypotheses to change 
in their relative explanatory power with reference to the same set of 
events. 
ie degree of complexity of 

we make our choice. Comp 
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sumptions should also be considered. A complex assumption wall sa 
stantiated factually and logically is probably less precarious than a simple 
assumption deficient in theoretical and factual support. sä 
A criterion that is in use, but one that should be discouraged, is the 
amount of satisfaction that the application of the hypothesis gives to the 
investigator. This refers to the fact that a scientist may experience con- 
siderable satisfaction when his hypothesis enables him to bring together 
several bothersome loose ends that have plagued him over a long period 
of time. As stated elsewhere, facts substantiate hypotheses; feelings are 
not a source of evidence for confirming hypotheses. Choosing from several 
hypotheses the one that is personally satisfying should be done only 
when that hypothesis measures up to the other criteria mentioned above. 
Determining When a Hypothesis Is Confirmed. The Problem. As we 
have indicated elsewhere, confirming a hypothesis does not require the 
removal of all doubt. We are not required to attain certainty in the pre- 
dictions based on the hypothesis. Rather, we determine in terms of the 
evidence how well the hypothesis is substantiated. If there are many 
different kinds of data pertinent to the hypothesis and favorable to it, 
we conclude that the hypothesis is well confirmed. With few positive 
data, we conclude that the hypothesis is poorly confirmed or unconfirmed. 


The concept confirm is to be interpreted as a continuum along which there 
are varying degrees of sustantiation justified by varying amounts of 
favorable and unfavorable evidence. The question: When is a hypothesis 
confirmed? is a very difficult one to answer, 
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are certain of the truth of a hypothesis, for this would be an eternally 
long time. Neither should we begin applying it before it has been sub- 
jected to empirical examination. Somewhere in between these two points 
we shall have to establish confidence in our hypothesis and be willing 
to recommend its use to other investigators. 

By using certain statistical procedures for evaluating the empirical 
evidence supporting his hypothesis. the scientist can compute what are 
called confidence limits. These limits are mathematical statements of the 
probability that chance might have accounted for the results. By choos- 
ing limits within which the contribution of chance is small, the scientist 
has confidence that his own hypothesis might be an adequate explanation 
of the results, 

_ The point at which a hypothesis is found 
in part, upon certain personality characteristics of the individual scientist. 
If he is willing to be found wrong occasionally he will be willing to 
advocate the utilization of the hypothesis with less evidence available 
than if he is conservative in nature and wants to make doubly sure be- 
fore he advocates its use. , 

The Explanatory Power of a Hypothesis. The confirmation of a hy- 
Pothesis is related to its effectiveness as an explanatory device. This re- 
fers to the number and importance of the facts, conditions, relationships, 
ete., that are encompassed by the hypothesis. It refers to the extent to 
which the hypothesis enables us to reach toward unknown variables and 
incorporate them through theory with the known variables. The effective- 
ness of the explanatory function is closely related to the amount of factual 
evidence, A good gauge of this amount is the number of empirical tests 
that have favored the hypothesis. Another characteristic to be evalu- 
ated is the pertinence of the evidence. This is determined by the im- 
Portance of the implications that have been examined and the degree to 
Which the theorems and the empiric represented and evalu- 


ated these implicati 
se implications. 3 
The Predictive Power of a Hypothesis. Another factor conditioning the 
€gree to which a hypothesis is confirmed is the power and significance 
of the predictions that issue from it. If the hypothesis enables us to make 
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dependently of the existence of any other hypothesis, the scientist finds 
greatest confidence in that hypothesis that is supported by the largest 
amount of evidence. Having been able to show from empirical tests 
that one hypothesis is superior to all other hypotheses, he will be able to 
apply this one with greater confidence. To demonstrate the superiority of 
any one hypothesis, however, involves a comparison of all of the hy- 
potheses in terms of the various characteristics that we have considered 
in previous sections. This is no mean task, but its accomplishment amply 
repays the investigator in terms of greater insight into the meanings, 
implications, limitations, and validity of the hypothesis that he finally 
selects for further study. 
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CHAPTER 9 


Collecting the Facts 


In the immediately preceding chapters, steps were described for de- 
veloping a problem for empirical study. Starting with rather vague no- 
tions about the problem, we learned how to delimit its boundaries by 
theoretical and factual analyses and to reduce it to concrete and precise 
empirical conditions. It was shown that some advantage accrues from 
stating our problem in the form of a hypothesis and deriving from this 
hypothesis one or more theorems that can be empirically tested. In this 
deductive elaboration of a hypothesis, suggestions are often obtained 
Concerning the nature of the empirical testing situation and the pro- 
cedures to be used in objectively testing the theorem. 

In the present chapter, we shall be concerned with problems associated 
with setting up and conducting the empirical test. An adequate test of 
a theorem requires that the empirical situation contain the conditions 
or factors that are stated or implied in the theorem. Through the testing 
Situation, facts are collected that bear upon the variables and relation- 
ships contained in the theorem. The situation must, then, provide for the 
Presentation of the particular stimuli that will initiate or force the ex- 
Pression of the relevant behavior variables. Procedures should be used 
that will make possible an accurate description of all the variables oper- 
ating in the situation, Furthermore, the observations of the scientist 
should be accurately recorded, and should be sufficient in number to 
Provide reliable answers to the various questions raised by the theorem. 


DESIGNING THE EMPIRICAL TESTING SITUATION 


Basic Conditions to Be Met. Designing the Test to Fit the Problem. 

he organization of the particular procedures, materials, apparatuses, 
and subjects selected to form an empirical testing situation is called the 
design of the study. The central thesis in terms of which this selection 
and organization is accomplished is the specific theorem to be tested. 
The theorem expresses the potential relationships among the variables 
that are judged pertinent to a given hypothesis. The testing situation is 
177 
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designed to get an empirical expression of these variables under the spe- 
cific conditions set by these relationships. 

The theorem is the standard against which we evaluate our methods, 
apparatuses, and operations. The phenomena encompassed by a theorem 
usually are restricted to some particular class of objects and relationships 
that are described in terms of their nature, frequency of occurrence, 
probable extent, etc. These characteristics of the theorem serve as checks 
against which to evaluate the testing situation. We can inquire if the 
specific phenomena that we have chosen for a test situation are represen- 
tative of the class of phenomena stipulated in the theorem; if the situation 
that we have selected affords ample opportunity for the expression of 
the phenomena under the conditions of the theorem; and if the relation- 
ships that are expressed among the phenomena in the concrete situation 
that we have selected are the relationships demanded by the theorem. 
Thus we can logically evaluate the test for consistency and pertinency 
before we actually put it into operation. The theorem then determines 
the appropriateness of the techniques, methods, apparatuses, etc., previ- 
ous to their being utilized. 

Let us consider a problem dealing with the functioning of the cerebral 
cortex of the brain. Suppose that we start with the hypothesis that the 
mass of the intact cortical tissue determines the effectiveness of behavior. 
From this we deduce the theorem: Increasing amounts of destruction in 
the frontal lobes of the cortex produce increasing amounts of deteriora- 
tion in intellectual-type functions. Let us see how the design of the study 
will be conditioned by the variables and relationships expressed in the 
theorem. We must select subjects that have well-developed frontal lobes: 
to be sure that cortical tissues are present. We must obtain individuals 
(human or animal) with varying amounts of loss of frontal-lobe cortex. 
This, then, means that if we use human subjects we must depend upon 
head-injury cases and the rather infrequently occurring cases with sur- 
gical lesion. With animal subjects we can produce the lesions through 
surgical means. We must select or devise procedures for measuring vary- 
ing amounts of destruction of the brain tissue. We must study the per- 
formance of the subjects in problems that reflect intellectual behavior. 
We must organize objective situations in which variations in quality or 
effectiveness of behavior can be expressed and measured, We must estab- 
lish controls over variables other than the variable of destruction o 
frontal cortex which might produce deterioration in the intellectual type 
of behavior selected for study. We must select statistical or other proc? 
dures by which we can detect the nature and extent of any relationship 
that might occur between deterioration in behavior and cortical destruc” 
tion. These are a few ways in which the design of the experiment must 
be tailored to the specifications set down in the theorem. 
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Eliciting Expressions in the Experimental Variable. A problem delim- 
ited for the purpose of scientific study usually consists of one variable— 
or at the most a few variables—which logically is directly related to the 
conditions or elements of the theorem. As noted earlier, such a variable 
is known as the experimental variable. It is so called because it is through 
this variable that the theorem is experimentally represented in the em- 
pirical testing situation. Although the term experimental variable is usu- 
ally used to refer to the variable that is purposely changed, it should 
be noted that we are interested in the changes in some other variable that 
follow this purposeful variation; that is, we are interested in the func- 
tional relationship between the two variables. 

One of our problems is to set up the experimental variable in such a 
way and in such amounts that the conditions of the theorem will be ade- 
quately represented in the objective situation. The characteristic of rep- 
resentativeness is neither easy to describe nor easy to realize in a given 
experimental study. The theorem will state or imply the characteristics 
that the experimental variable should have. These will include the nature 
of the variable and the amount or frequency of its occurrence. The vari- 
able selected for our test must be the kind demanded by the theorem, 
and we must elicit expressions of the variable in the amounts or fre- 
quencies that will test the relationships pertinent to the theorem. Usu- 
ally, it is necessary to have expressed at least two values or amounts of 
the variable—but preferably more than two—in order to determine pre- 
cisely any functional relationship that might be present. a 

Let us continue with the theorem that concerns the relationship be- 
tween deterioration in behavior and the amount of loss of cortical tissue. 
Suppose that we select rats as our subjects. We can do this if we have 
assured ourselves that rats have a well-developed frontal cortex and 
are capable of behavior that can be described as intellectual in nature. 

© continue with the development of the testing situation: we must be 
able to produce variable amounts of cortical destruction in the frontal 


lobes of the rat. There are, of course, several ways this can be done 
c currents. We must also be able 


throu ie ected 
gh the use of cauterizing elect : ps 3 
to determine the relative amounts of destruction of cortex in the different 
operative rats. Again, there are several methods available by ee the 
amount of destruction can be estimated after the brain is removed from 


the skull. It is, then possible to produce variable amounts of expression 
oF the experimental variable—amount of cortex destroyed—and to esti- 
mate the amount of destruction in each of the operated rats. 

In our empirical test, we also need to have a means for measuring the 
tať’s intellectual behavior. Learning @ complex maze would be appropri- 
ate. Maze learning requires the use of several sensory capacities 0 the 
animal and requires him to perform rather complicated behavior in re- 
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sponse to a complex stimulating situation. The maze behavior of the rat 
can be quantitatively described so that reliable estimates can be made of 
the amount of any loss that might occur after a cortical lesion. 

Controlling Related Variables. When working with complex global be- 
haviors, we usually discover many variables that are functionally asso- 
ciated with the behavior we wish to study. Having chosen one—or a few 
—of these as the experimental variable, it is imperative that the effects of 
other associated variables be prevented from influencing the behavior 
during the experimental testing period. These other variables might elicit 
changes in the behavior similar to those expected from the experimental 
variable. Obviously, upon completion of a study we shall want to ascribe 
the results to the variations instituted in the experimental variable. We 
cannot do this, however, if one or more other variables have been present 
to which the results can be ascribed. It is possible that these other rele- 
vant variables might produce an effect opposite to that expected of the 
experimental variable. Under such a condition, the effect of the experi- 
mental variable would be canceled out. Our findings would then indicate 
that the experimental variable was not related to the behavior under 
study, whereas in fact it was positively related to this behavior. 

In our experiment on the cortex of the frontal lobes, we would pur- 
posely produce variation in the amount of tissue destroyed but try to 
eliminate the effects of any other variables which might cause a deteriora- 
tion in the performance of the animals. It has frequently been found that 
rats with brain operations are more susceptible than nonoperated rats 
to respiratory infections such as colds. If the colony of rats we were 
studying became afflicted with colds, this variable would tend to Pro 
duce a change in behavior similar to that associated with loss of brain 
tissue. That is, the colds would be expected to cause a deterioration in 
performance, and the deterioration would be greater for the rats with 
the most severe or most prevalent colds. These, of course, would be the 
operated rats. 

Evaluating Chance Factors. Even after the experimental and relevant 
systematic variables have been brought under control, there will remain 
other variables affecting the behavior under study. These are the unsys- 
tematic variables discussed in Chap. 5. They operate both to facilitate 
and to inhibit the behavior. Although in the long run the expressions © 
behavior associated with the experimental variable will not be influence 
in a constant direction, at times these chance factors may operate strong y 
enough in one direction to account for the changes observed in the data 
of a given experiment. When this occurs we are not justified in ascribing 
the changes to our experimental variable. 

In designing an experiment, we must devise the data-collecting proc? 
dures in a way that will provide a basis for estimating the importance 0 
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chance factors. This is accomplished when all factors but the experimental 
variable are forced to operate in an unsystematic manner. In the dis- 
cussion on Control of Psychological Variables we noted some of the 
methods by which this can be done. In later chapters we shall have occa- 
sion to consider the problem again. The designing of a study to make 
possible an evaluation of chance factors is sometimes a very difficult 
matter, and it is not possible here to describe the variety of problems 
that may arise and the procedures available for solving them. It is im- 


portant for us to know that the evaluation of chance factors depends 


upon the nature of the design of the data-collecting procedures, and that 
ds appropriate for most of 


there are experimental and statistical metho 
the problems that arise. 

To illustrate how chance factors may enter into the design of an experi- 
ment, let us consider our study on the function of the frontal lobes. Sup- 
Pose we perform operations involving three amounts of destruction, say 
10 per cent, 20 per cent, and 40 per cent, and include only two rats in 
each of these conditions, or a total of six rats. Experience has shown that 
there are large individual differences in the learning performance of rats 
on mazes. Furthermore, it has been demonstrated that a given amount of 
cortical destruction will not have exactly the same effect on the behavior 


of different rats, By chance alone, we might assign the two fastest 
learners to the 40 per cent condition and the two slowest learners to the 


0 per cent condition. Thus chance distribution of this factor of learning 
ability would tend to cancel out any differential effects of the various 
amounts of cortical destruction. Increasing the number of rats and care- 
ful random assignment of the animals to the different lesion groups 
Would mitigate against chance factors contributing significantly to our 


results, 
nd Crucial Experiments. Exploratory Ex- 


Exploratory, Confirmatory, a 
periments, In the development of a problem for scientific study, ques- 
tions often arise concerning details of the procedures that cannot be 
answered satisfactorily from the then current knowledge. Information 
can be collected on these questions by conducting pilot or exploratory 
experiments. Questions may arise in connection with the effectiveness 
of a piece of apparatus, the feasibility of a procedure, the difficulty of a 
Set of instructions to be given the subjects, ete. It is preferable to conduct 
exploratory studies to collect factual information about the apparatus or 
Procedure or instructions than to put complete trust in a subjective judg- 
ment of their effectiveness. For example, pilot studies are often made in 
Order to evolve accurate procedures for destroying brain tissues. The 
amount of destruction of cortical tissue from the use of an electric cau- 
tery is a function of several variables among which are the size of the 
electrodes, the amount of current, the duration of the current, the pres- 
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sure applied to the electrodes, and the amount of fluid surrounding, the 
area to be destroyed. It is not likely that we could determine through 
reasoning the exact combination of these factors that would give us the 
particular amounts of destruction that we have planned for the study. 
Furthermore, for such a complex problem, even experiments similar to 
our own, previously conducted by others, would seldom provide the exact 
data necessary to set up and calibrate our own apparatus and procedure 
without the aid of exploratory studies. 

Exploratory experiments are not limited to problems of procedure but 
may be conducted on nearly any question that concerns the reduction of 
the theorem to an objective test. Sometimes certain characteristics of the 
individuals to be used as subjects are sought in exploratory studies. Some- 
times items of theory may need preliminary investigation. The purpose 
of exploratory experiments is to perfect all of the rational and empirical 
phases of the test situation so that the objective of devising an adequate 
empirical check of our theorem will not be defeated. s 

Confirmatory Experiments. As the name implies, this kind of exper- 
ment is conducted to confirm or disconfirm a hypothesis. It is set up wo 
test a theorem of that hypothesis. As already stated, any empirical study 
of a hypothesis must be established on a series of sound, logically devel- 
oped relations between the hypothesis and its theorem and between the 
theorem and the empirical test. If these relations are logically substan- 
tiated and if positive results are obtained in the experiment, then the 
findings confirm the hypothesis. 

An experimental study may be very efficiently conducted and still not 
either confirm or disconfirm the hypothesis under examination because 
of failure to develop sound deductive relationships between the hypothe- 
sis and the test. At times, however, such a failure may actually prove 
beneficial, because of the corrective effects it will have on any subsequent 
deductive elaboration of the hypothesis. The experiment has then serve 
an exploratory function. 

Crucial Experiments. This kind of experiment is one that presumably 
settles the fate of the hypothesis being studied. It is a rare occurrence in 
science. The highest probability of its occurring is when the hypothesis a8 
extremely simple in structure, when the hypothesis can be adequately 
represented in a single theorem, and when, in turn, the theorem can 6 
given a thorough check in an empirical situation involving a few well- 
controlled, accurately measurable variables. When the hypothesis in- 
volves highly conceptualized elements, or complexly structured variables: 
the probability is almost nil that a single theorem can be devised that 
will encompass all of its conditions. If such a theorem were devise® 
the probability again is almost nil that it could be represented in a single 
objective experimental situation in which the measurement of the V@” 
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ables and relationships would be unassailable in respect to their relia- 
bility and validity. 

It is possible, of course, to formulate hypotheses of simple structure 
and deduce theorems that can be adequately checked in a single experi- 
ment. For example, we could champion the idea that there is a positive 
relation between the amount of pigment in the eye and intelligence. The 
theorem could be stated: All blue-eyed babies develop into dull-witted 


children, Obviously, by discovering one intelligent blue-eyed child we 


could disconfirm the hypothesis. 

A problem simple enough to be red 
will not add greatly to our knowledge 
restricted theorem and a very narrow type of test situation, In our effort 
to improve our understanding of such a complex organism as man, we 
need not worry ourselves about crucial experiments—they have been 
and will continue to be extremely rare in occurrence. 

: Choosing from among Several Testing Situations. The Problem. Some- 
times more than one testing situation will come to mind in connection 
With a given theorem, and we may be compelled to make a choice among 
them. It is not presumed that each testing situation will represent equally 
Well all of the conditions implied in the theorem. Probably the theorem 
would have to be modified somewhat to justify the use of all of the situa- 
tions, Modification of a theorem, however, does not necessarily weaken 
it. Once we are under way in planning a test we may be willing to 


modify the theorem in terms of some new development that arises out 
of our effort to devise a testing situation. Whether the theorem is strength- 
he soundness with which the 


ened or weakened will depend upon t 
changed theorem represents the conditions of the hypothesis. 

Other factors that may strongly influence our selection of a given test- 
ing situation are the relative ease with which the test can be conducted, 
the relative cost of the test, our ability to execute the test, the availability 
of apparatus or subjects, and the like. The final choice, however, should 
rest primarily on whether the test selected will adequately check the 


theorem we have decided to examine. 
T An Example. The following example 
ng situations can be evolved for studyi 
reflection will show that the several situations 
exactly the same theorem even though logic 
ypothesis, Suppose, as a personnel psychologist in a factory, we set up 
— following hypothesis: Emotional upsets result in lowered production. 
acts could be collected by conducting an experiment in a psychological 
aboratory where subjects would be given work to do and then exposed 
to social stimuli that were upsetting in nature. Or an experiment could 
© conducted with workers at a factory bench while they were doing their 


uced to a crucial experiment usually 
because it will involve a highly 


will illustrate how different test- 
ng the same hypothesis. A little 
would not all be checking 
ally relevant to the same 
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regular work tasks. Or a special room at the factory could be used for an 
experiment where only certain selected types of work were presented. 
Or records could be kept of the workers’ production at the bench and 
examined periodically, and when a worker's output showed a tendency 
to decline he could be given a special interview to determine whether or 
not any emotional difficulties were present. Or workers could be asked 
to keep diaries in which they would describe their current emotional 
difficulties, and then these difficulties could be checked against their 
record of output. Not all of the facts collected through these different 
methods would be comparable. It would not be expected that an answer 
based on the findings of any one study would necessarily be supported 
by those from any other study. 


SOME CHARACTERISTICS OF THE PHYSICAL 
TESTING SITUATION 


In any science, the fundamental problem exists of producing the con- 
ditions necessary to test the theorem under consideration. In psychology. 
it is through the production of the appropriate stimulus conditions that 
we elicit the particular elements in behavior that are related to the the- 
orem we have under study. The facts required for testing the theorem are 
facts about response characteristics. 

The Stimulus Situation. Focal and Contextual Stimuli. As we have 
pointed out several times, any given situation contains many different 
stimulating features. For the purpose of creating a particular type a 
effect upon a subject, a given stimulus characteristic is singled out for 
manipulation. The subject is directed to pay special attention to this 
aspect of the situation. Because the characteristic is the center of atten- 
tion, it is called the focal stimulus. For example, if the subject were ne" 
quired to discriminate two red lights differing in brightness, the inten- 
sity of the lights is the focal stimulus and would be varied. The wave 
lengths and composition of the lights would be held constant. 

Other aspects not part of the center of attention are referred to 8 
contextual stimuli. They also may be the object of experimental study ene 
may be varied in much the same way as focal stimuli. Usually the subjec 
is not made aware of this variation, since the investigator is interested in 
the effect of the stimulus when it functions as context. To make the sub- 
ject aware of any variation introduced in the context would likely change 
the stimulus from contextual to focal. For example, in studying the effec 
of masking noises upon pitch discrimination, various levels of noise as 
contextual stimuli are introduced coincident with the tones to be gis” 
criminated. If the subject attends to the changes in noise level rather 
than to the changes in tone, the object of the investigation is defeated. 
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Stimulus Categories. The psychologist has been concerned with manip- 
ulating many varieties of sensory stimuli, and in so doing he has devised 
several workable schemes for classifying them. One of these—the familiar 
sensory modalities—will serve our purpose. The categories are the familiar 
ones of visual, auditory, cutaneous, olfactory, gustatory, kinesthetic, 
static, and organic. These categories originated from the differences in 
activity of the several special kinds of sensory tissues in the body. 

We need to add symbolic stimuli to the sensory modalities in order to 
achieve a comprehensive classification of the many kinds of stimulation 
used in psychological research. By symbols it is possible to present prob- 
of the higher mental responses. Through these 
stimuli the more complex global types of behavior can be elicited. Al- 
though symbolic stimuli are primarily visual and auditory in sensory 
character, to place these symbolic stimuli in one or the other of these 
Sensory categories does not classify the stimuli in terms of their primary 
stimulating effect, namely as word or sign meanings. The stimulating 
effect of symbols comes from the meanings that are aroused by the visual 
or auditory cues and not from the sensory characteristics peculiar to 


Visual and auditory objects. 
Kinds of symbolic stimuli are many 


lems requiring the use 


and varied. They include word 


stimuli, both seen and heard. They include natural objects of all kinds, 
such as plants, animals, human beings, and the like. These are capable 
of wide variations in their stimulating effects. Symbolic stimuli literally 
include all stimuli that are capable of eliciting meanings not directly 
reducible to the sensory experiences of the moment. 

The Production of Stimuli. One problem confronting us is the produc- 
tion of a stimulus of the right kind, in the appropriate amount, at the 
Most opportune time. In simple laboratory-type discrimination problems, 
such as distinguishing between lights of different color or brightness, or 
Sounds of different pitch or loudness, the nature of the stimulus can be 
very accurately controlled. It can be varied with great precision by 
Means of simple electrical devices inserted in the light- or sound-produc- 
Mg circuits. Sometimes producing the right kind of stimulus in the labo- 
ratory situation may prove a difficult problem. For example, the genera- 
tion of a stimulus tone that can be varied from a half-cycle duration to 
Several cycles and produced at different frequency and intensity levels 


requires an expert knowledge of the physics of sound. a 
The nature of the stimulating effect is a critical characteristic in most 
testing situations involving symbolic stimuli. The particular meanings 
g 


{roused will depend upon the interpretations placed upon the symbols 

Y the individual subjects. In simple memory and perception experi- 
ments, adequate control of meanings can usually be achieved. In memory 
experiments, for instance, careful selection of the material to be learned 
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enables us to reduce the possible variation in meanings and at the same 
time to achieve some control over the difficulty of the task to be pre- 
sented to the subject. 

In social-type situations, where several individuals are interacting, 
very little control may be attained over the meanings aroused. Suppose 
the problem requires the study of a child’s social behavior in response 
to the stimulating influence of several persons of his own age. This can , 
be done in the play-school situation. Some control over the stimulus 
situation can be achieved by a careful selection of the stimulus children 
to whom the subject is to respond, by the types and number of toys that 
are provided to all of the children, by the instructions given to the 
stimulus-children on what they are to do under certain conditions, and 
so on. But even the most elaborate planning of the stimulus situation 
does not guarantee that the exact stimulus-meanings that are planned 
to be presented to the child will actually occur, The particular meanings 
that the child will ascribe to the situations presented are in large measure 
dependent upon his own past experience and his mood of the moment, 
neither of which is brought under control. 

Producing the stimulus at the most opportune time can readily be 
achieved in laboratory experimentation. In the laboratory, the subject 
can be stimulated to respond more or less at the convenience of the 
scientist. In a complex social situation, the scientist must seek out the 
particular occasions when the behavior he wishes to examine occurs 
under the least interference possible from unwanted disturbing variables. 
Suppose that the hypothesis he wishes to investigate is concerned with 
human relations, and that the conditions of the theorem selected for study 
could be found in representative expressions in the labor-management 
type of conference. It would then be necessary to delay the investigation 
until an issue arose for which a conference between labor and manage- 
ment was called. Conditions of the theorem very likely would not func- 
tion in a representative fashion if a conference were especially arrange 
to conduct the study when no bona fide difference between labor an 
management existed. 

Subjective stimuli are far less easily manipulated than objective 
stimuli, and the timing of their occurrence is usually very difficult. For 
example, in studying the influence of some drug, such as Benzedrine, on a 
given type of learning behavior, it is difficult to select that moment for 
commencing the learning that will coincide with some particular moment 
in the absorption of the drug, such as that moment at which the drug is 
producing its greatest effect on the individual. 

The Quantification of the Stimuli. The problem of producing the stimu- 
lus in certain desired amounts occurs in every experiment. The nature 
and amount of the behavior elicited by a stimulus very frequently is a 
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function of the amount of the stimulus presented. In sensory experi- 
ments involving simple stimuli such as brightness of lights, pitch of 
sounds, intensity of pressures, the amount of the physical stimulus can 
be accurately determined. 
; Symbolic stimuli vary in the amount-intensity characteristic, but there 
is no simple unambiguous meaning of magnitude that can be applied 
to every form of symbolic stimulus. Frequently, the meaning of amount 
is referred to the number of stimulus units presented, for example, the 
number of words to be memorized in a learning experiment, or the num- 
ber of problems to be solved in a reasoning experiment, or the number 
of trials to be completed on a psychomotor coordination test. Sometimes 
the amount meaning is referred to the level of difficulty of the stimulus 
material or task. Words to be learned, problems to be solved, and psycho- 
motor tasks to be completed can be devised that vary in the ease with 
which the person can perform the required response. It is then possible 


to present symbolic stimuli that vary in the level of difficulty with which 


they can be understood and responded to by the subject. 
mental variable be ex- 


In many studies it is necessary that the experir 
Pressed at several predetermined levels of amount, and that any one of 
these magnitudes be produced at the convenience of the experimenter. 
This objective can usually be obtained for the simple forms of stimuli 
involved in the study of sensory processes. It is more difficult to achieve 
for the stimuli utilized in the studying of learning, memory, and think- 
ing. It is very difficult to achieve in the study of responses expressive 


of emotion and personality. , 
The production and control of a stimulus variable at any level of ef- 
fectiveness is greatly improved by the use of measuring devices. These 
devices are not restricted to apparatus of the þrass-instrument variety. 
Measurement and control of symbolic stimuli can be obtained through 
the use of lists of verbal material, test items and problems, pictures and 


lagrams, and oral and written instructions. ; i 
i. One of the difficulties associated 


The Representativeness of Stimul 5 
With stimulus manipulation is related to the problem of eee 
ness, Testing a theorem requires the functioning of a certain variable or 
Combination of variables in a representative fashion. This means that 
the stimuli eliciting the variables must be controlled in a way that will 
achieve the desired representative functioning. The exact meaning of 
representativeness is difficult to state except in reference to the specific 
Variables involved and the relationship between the functioning of these 
Variables and the conditions demanded by the particular theorem. r 

he problem of representativeness can be illustrated by a hypothetical 
experiment requiring that we obtain the mean performance 


of high school 
Sraduates on a college entrance examination in geometry. Although the 


188 Steps of the Scientific Method 


problem merely demands the mean performance of a specified group on 
a certain type of test, it is a very complex one to solve in terms of the 
factor of representativeness. There are three aspects to representativeness 
in this particular example, namely, getting a set of geometry problems 
that are representative of those used in college entrance tests, getting a 
representative sample of high school graduates, and getting the sample 
of subjects to perform in a representative fashion on the geometry exami- 
nation. The first and third aspects are directly concerned with the repre- 
sentative nature of stimuli. The choice of the test and its items forms one 
aspect of the stimulus variable. A second aspect includes the associated 
stimuli by which high school graduates are encouraged to respond repre 
sentatively to the test conditions. Failure to achieve representativeness 
in any one of the three ways mentioned would prevent a solution to the 
problem. 

Representativeness and Interaction among Variables. Interactive effects 
among variables make it difficult to obtain representative expression. 
We have learned that the testing of a theorem may require the production 
of combinations of stimuli. In psychology, the simultaneous functioning 
of several variables is the rule rather than the exception. When several 
determiners function simultaneously they interact with one another. 1 
interaction is to be conceived as being itself a variable of damen j 
complexity. It also must be interpreted as a determinant variable, for t we 
influence that a factor will have on some end result is determined in Pait 
by the presence or absence of some other factors that facilitate or inhi 
its expression. When the picture is complicated by having many pee 
operating simultaneously, the problem of interaction becomes undeniably 
difficult to unravel. ne 

It is apparent that to obtain representative performance in a al 
variable we must have considerable knowledge about its functional rela 
tionships with other variables. These other variables must be forced t° 
function in the particular ways and in the particular amounts that 17 — 
produce a representative performance in the variable under study. m 
various aspects of the stimulating situation must be balanced in such 4 
way as to achieve this objective. Scans 

The problem of interaction is often avoided by setting up a test . 
tion in which only one variable is allowed to change in value, all oe 
variables being eliminated or held constant in value. It should be = 
that under these conditions the variable under study is restricted in 7 
function, and its expressions are not representative of those that wou 
be obtained if other variables were free to change in value. This proble 5 
is illustrated in the early attempts to study multivariate determiners o a 
given response. In these early studies, the logical step was taken iene’ 
ing the multivariate problem into the form of the single-variable exp 


m 
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ment. For example, if there were four factors contributing to the deter- 
mination of a response, then four experiments were conducted. In each 
experiment one of the four factors was allowed to vary while the others 
were held constant. It should be apparent that as long as interaction is a 
contribution of any factor to a total situation cannot 
he single-variable experiment. 
lly designed for iso- 
applicable to 


possible variable, the 
be discovered by multiple application of t 
Although there now are several procedures especia 
ng interaction, they are not universally 
encountered in studying behavior. 
Registration of the characteristics of the 
stimulus is necessary in order that the stimulating conditions can be 
kept constant over time and constant among the different individuals 
serving as subjects. If an apparatus is used for generating the stimuli, 
it is usually a simple problem to calibrate the adjustments or settings of 
the dials or other controls so that the same stimuli can be reproduced 
whenever desired. Registration of the nature, the intensity, or the dura- 
tion of the stimulus is then not difficult. 
With symbolic stimuli, the problem of registration may be simple if 
the symbols elicit uniformly the same meaning on different occasions and 
from different subjects. The registration becomes extremely complicated 
when the interactions of several subjects are part of the stimulus situa- 
tion, for then the meanings and interpretations of the stimuli are con- 
Stantly changing. In this type of situation, a complete record can be 
taken of the verbalized stimuli by means of a wire or tape recorder. So 
Much of the stimulus situation as is transmittable through auditory sym- 
bols can then be studied. This procedure does not make possible a meas- 
urement of all of the characteristics of the situation, but it provides a 
Sreat deal of important information about certain aspects of the stimuli, 
Some of which can be put into numerical form. For example, transcripts 
of the deliberations in conferences between representatives of labor and 
Management have made it possible for investigators to quantify as to 
requency of occurrence certain attitudes displayed by management 


toward Jabor and certain attitudes displayed by labor toward manage- 


ment. 
The Response Situation. Response Process Or Product. In studying 
behavior, we can consider the activity in progress, which involves a study 
of the psychological processes oceurring during the response, or we can 
irect our attention to the changes resulting from the response, in which 
d by the activity. 


Case we analy ctivity or the product create 
sie il 3 mental facts by which the nature 


Both 

types of procedure discover fundar 3 : 
of behavior can be unraveled. When only one attack is possible, the n 
Tevealed by this one usually make possible inferences concerning the 
acts that would otherwise be gained from using the other procedure, 


lating and measurit 
the diverse situations 
The Registration of Stimuli. 
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That is, we can make inferences about products from knowledge of 
processes, and we can make inferences about processes from knowledge 
of products. 

Response Categories. When the words response, reaction, or activity 
are used, we should not think only of the overt changes observable in 
the movements of an individual. Behavior consists of many forms of 
response; these can be subsumed under one of the following five cate- 
gories: sensory and perceptual experience, higher mental activities, skele- 
tal muscular response, visceral and glandular reactions, and sense-organ 
and nervous-system changes. These categories are not mutually exclu- 
sive. For example, responses of the first two categories are mediated 
through the changes in sense organs, nervous system, and muscles, which 
comprise the other three categories. 

The immediate response to most stimuli is sensory or perceptual ex- 
perience. Sensory experience is distinguished from perceptual experience 
primarily in terms of the direction of attention of the subject. In study- 
ing sensory experience he attends to the subjective impressions elicited 
by the stimuli. His attention is focused upon the sensory experience 
itself. In perception, the subject attends to the stimulus objects and their 
relationships. The problem is to determine how accurately these objects 
and relationships are observed. The attention is on the objective facts 
rather than on the subjective experience. 

Stimuli in the form of symbols arouse conceptual meanings. Through 
these symbols the subject may be required to utilize the higher mental 
responses such as memory, reasoning, imagination, judgment, and the 
like. 

Skeletal muscular responses are executed by the surface muscles of the 
body. These responses are either phasic or postural in nature. The first 
refers to rapid periodic changes in the muscles by which certain end 
adjustments are accomplished, as, for example, movements of the leg; 
arm, eyes, vocal cords, etc. Postural responses involve the tonic con- 
traction of skeletal muscles which effects the long-enduring substratum 
of tension upon which the phasic movements are superimposed. 

Visceral and glandular reactions include all of the vital functions of 
the body. Heart action, vascular changes, respiration, digestion, an 
endocrine-gland secretion are examples. At times these internal responses 
become important conditioning factors of other forms of response, as in 
the case of emotional reactions. For example, all of us have had the 
experience of being unable to recall a familiar name (mental activity 
because of feelings of anxiety (emotional activity) which result from the 
increased visceral activity engendered by the importance of the general 
situation. 
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Sense-organ and nervous-system responses refer to the actual tissue 
changes that occur when sense organs and nervous tissues are excited to 
activity. The psychologist at times turns physiologist and delves into the 
mechanics of tissue change that underlie both sensory experience, as 
mediated by the sense organs, and various mental activities, as mediated 
by the central nervous system. 

The Detection of Response. We must be able to detect in the subject 
all of the changes that are pertinent to the stimulus conditions we present 
to him. Some behavior components are readily detected, such as move- 
ments in the surface muscles of the body. The expressions of the face, 
the clenched fist, the eye-hand coordinations in typewriting, are all easily 
observed. 

Subjective responses are 


of the response process rather than the re 
studied. A perennial problem that still baffles the psychologist after 


many years of research is the exact nature of the feelings of pleasantness 
and unpleasantness. These feelings are familiar reactions. All of us know 
when we feel pleasant or unpleasant. Difficulty arises when we try to 
detect the behavior changes that underlie these affective experiences. 
We seem not to be able to describe accurately what goes on when we 


feel pleasant or unpleasant. 

Similar difficulties arise when we study higher mental responses like 
learning or thinking. We can detect part of the response in terms of the 
Product resulting from the activity. For example, we can discover some 
facts about learning from the length of time required to learn or from 
the amount of material recalled after an elapse of time. It is more difficult 
to find out exactly what happens in the individual when he learns. One 
approach to this problem is to study stimulating situations that vary in 
Some prescribed way intended to evoke different mental processes. The 
differences among the stimulus conditions are constructed so as to re- 


Quire the subject to use different mental procedures in evolving the solu- 
e several situations, the chances 


tions. If the results obtained differ for th j 

are high that they have resulted from differences 1n the mental processes 
of the subject. Some information about the subject’s mental processes can 
then be inferred from the differences in the nature of the stimulus condi- 
ti 

tons presented to him. 


i he experiments comparing logical a i 
is approach. In logical learning, the subject is encouraged to form 


associations among the meanings of the material to be learned and be- 
tween these meanings and his past experiences. In verbatim learning, 
the subject memorizes by merely repeating over and over each meaning, 
with little or no effort being made to discover or form relationships among 


difficult to detect, especially when the nature 
sult of the response is being 


nd verbatim learning illustrate 
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the meanings. The relative effectiveness of these two procedures is tested 
in the subsequent recall of the material learned. Logical learning always 
proves to be markedly superior to verbatim learning in terms of the 
accuracy and amount of the material recalled. This result would be con- 
strued to mean that the mental processes in logical learning, which in- 
volves associations among varied types of experience, issue in more 
effective retention than the mental processes occurring in verbatim learn- 
ing, which involves associations among more restricted types of experi- 
ences. 

Verbal Reporting by the Subject. This procedure is basic to a large 
amount of laboratory experimentation. For the psychologist interested in 
the nature of the process of response, as in the higher mental functions, 
it is indispensable; and for the study of products, it m 
very valuable. Verbal report is used to refer to all c 
a subject during an experiment that in any way are 
tion of the experiment. It is not to be conside 
word introspection. Intros 
of the experiences arous 


ay at times prove 
ommunications of 
related to the execu- 


ery often rests directly on the 
aple, in an investigation con- 
pon feelings of tiredness, the 
e extent of his feelings of tired- 
d very likely lose interest in it, 
tiredness, he would describe the 


e sound-motion picture record 
the experimenter of what he 


ressure, stomach contractions, or 
the psycho-galvanic Tesponse. Most subjective reactions, however es pe- 
cially the higher mental responses, must 8 a 
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duced externally to and separately from the devices used to register the 
changes. In studying the processes that go on in such an activity as rea- 
soning, the object of study and the registering instrument are one and 
the same, namely, the individual acting as the subject-observer. 

The registration of social behavior is proving a very difficult problem 
to solve. The situation to be observed is increased in complexity because 
an one individual to observe. The interper- 
sonal responses among the different subjects are more than any one 
experimenter can accurately observe. The addition of sound recorders 
and motion-picture cameras increases the accuracy. Even with these, 
it is impossible to capture all of the changes occurring in be- 


there are several rather th 


however 
havior. 

The Quantification of Response. Quantification of the subject’s reac- 
tions is closely associated with registration of response. Devices which 
record muscular reactions can often be made to register the amount or 
frequency of the changes. Writing levers registering reactions on “end- 
less” paper tapes may provide records of the general form of the re- 
sponses as well as their durations, extensities, and frequencies. 

Mental activities can be quantified through their products. The num- 
ber of problems worked in a given time, or the amount of time required 
to work a given number of problems, provide bases for expressing the 
responses in numerical form. Performance in motor skills, learning of 
mazes, memorizing of verbal materials, solving problems through rea- 
soning, etc., can be quantified in terms of units of work accomplished. 
Accuracy of the performance can also be measured by counting the 
number of mistakes or errors made either in a given amount of time or 
in a given amount of work. 

Measurement of attitudes, interests, and personality traits can be ac- 
complished through the use of questionnaires, inventories, tests, or rat- 
ings. The development of procedures for quantifying these complex be- 
haviors has been a slow process, and the best measures now available 
still lack the precision necessary for accurate prediction of individual 


performance. 


CONDUCTING THE OBSERVATIONS 

The Nature of the Observing Process: Mechanisms Involved in Observ- 
ing. Observation of an event means being aware of the event: that is, the 
observer experiences the event. Environmental changes, both internal and 
external, stimulate the sense organs, which in turn elicit changes in the 
sensory nerves. When these sensory-nerve changes reach the brain, we 
he event. These experiences are given names and 


have an experience of t : 
mbolized. 


their familiar aspects are carefully sy 
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Observation of an event is not merely the arousal of a sense organ 
and the consequent elicitation of activity in a sensory nerve or the brain. 
It also includes an attentive adjustment of the individu 
and the rearousal of past experiences in the form of mea 
with the event. Recognition of some aspect or feature 
conditioned upon a rearousal of past experience 
symbolizing of the experience, 

The Active Nature of Observin 
not a passive but an active res 
grandstand and watch a parad. 
the horses at a race track, 


al to the event 
nings associated 
of the event is 
» as is also the naming or 


g. Scientific observation of phenomena is 
ponse. As scientists we do not sit in a 
e of unit experiences go by, like observing 
On the contrary, we question nature; we go 


en note what happens. The problem that the 
scientist selects for study and the testing situation he establishes for 


attacking the problem in a large measure determine the particular vari- 
amined. Nature, then, is not only 
or forced to take place. 

vation is not an idle collecting of 
erized by mental sets, By the nature 
estigate, we are set to become aware 
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scientist’s observational behavior, as he is the major source of subjectivity 
in the scientific method. 

Safeguards in Attitude. Let us begin with the factor of attitude. The 
scientist must disclaim infallibility. This admission of the possibility of 
error has two salutary effects; namely, it keeps him on the alert during 
the making of his observations, and it makes him critical of his interpre- 
tations of these observations. The scientist must acknowledge the need 
for the elimination of bias and take positive steps to control it. There is 


much truth in the statement that a “pet theory makes for prejudiced ob- 


servation.” To counteract his own preconceptions, the scientist should de- 


liberately search for the exceptions. His attitude is not one of proving his 
notions but one of testing them. 

Adequate Training in Observational Techniques. Accurate collecting of 
the facts demands a thorough background in the field of the problem to 
be investigated and acquaintance with the observational techniques that 
are applicable in this field. Accurate observation results from practice 
in observing the kind of phenomena to be studied. A mere intention to 
observe accurately does not guarantee accuracy. If, in the field selected 
for research, special procedures have been developed through which the 
phenomena can be made to occur under conditions that will enhance 
accuracy of observation, then these procedures should be learned and 
used by the scientist. Adequate training will prepare the scientist, not 
only to know what he is looking for, but to know where and when to 
look. Knowing where to expect an event and approximately when it will 
occur goes far toward guaranteeing that when the event actually takes 


place it will not escape observation. 
Mechanical Supplements to Obse 


mechanisms of observation can be ai 
scientist is required to observe a visual object, he has many optical de- 


vices to aid him. Depending on the problem, use can be made of tele- 
scopes, magnifying glasses, microscopes, cameras, polygraphs, recording 
oscillographs, and similar gadgets. In the modality of hearing, there are 
sound detectors, amplifying tubes, recording machines, and the like. 

The Temporal Course of the Observations. The Necessity for Constant 
Conditions of Observation. Nearly all research studies occupy an appreci- 
able period of time, the observations extending over days or months or 
even years, If the observations made at different times are to be compa- 
rable, it is necessary that the conditions of observations be held constant. 
In psychology, this fact is of prime importance because of the possibility 
of change occurring in the powers of observation of the experimenter, of 
the subject, or of both, during the course of an investigation. 

Temporal Variations in Subjects. When persons are to be used as sub- 
jects, as in experimentation on sensation, perception, imagination, think- 
ing, and similar mental activities, it is often necessary that they develop 


rvation. Frequently, the sensory 
ded by mechanical means. If the 
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aintain a particular set or poin — . — 

A They must have sufficient background experience so tapea 

ni correctly understand the particular psychological concepts ra 

utilized. Much controversy has occurred in experimental work 175 ae 
tion because descriptive terms were not well understood, with the re 


iati medi > sub- 
that during the course of the experiment, variation occurred in the s 
jects attitude or set toward the tasks giv 


The stimulus-error experiment suppli 
Many studies have been conducted in 


ard the stimuli presented 


en him. N 
es an illustration of this pnr 
an attempt to learn the nature 0 
xperiments included reactions > 
at more precise control over n 
d in vision than in other sense 
esented with a visual stimulus a 
it. Sometimes the subject describe 
t, and sometimes he described — 
e. The stimulus-error was committee 
he stimulus object and described ae m 
g the experience aroused by the stimu- 
between the two points of reference in 
his descriptions produced uncontrollable variations in the course of pA 
t impossible to interpret the results 
Constancy of certain charact e subject is necessary even 
is not acting as an er. The tasks a subject is given 1 
do are to be carried out in the particular Way specified by the experi- 
hese procedures produces a ee 
parable results. For example, if, os 
a of arithmetic problems, the subject 
is told to work every problem in order, that procedure should be Tgp? 
ously held to by every subject throughout the testing period. If a a 
i oint in the test and skips some pre 
he feels are easier, his performance is = 
Pursuit task the subject may be to 


8 instead GE 
: oses to work for accuracy instea 
erformance is invalidated, 


eristics in th 
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: is more diffic > assured that 
operating fully thy difficult to be assur 


j true 
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when the subject is called upon to perform a routine task over an ex- 
tended time, such as adding 1-place numbers for a 3-hour period. Some- 
times, having the subject report his feelings about the task or about his 
achievement in the work assigned him may provide clues concerning his 
attitude, However, such information is not always reliable. 

In the case of animal subjects, it is often necessary to give preliminary 
training in the testing situation to familiarize the animal with its general 
physical features. In experiments on learning, the animal may be required 
pening gates, pulling strings, or pushing 


to manipulate objects, such as o 
h devices is usually not the 


levers. The animal's ability to operate suc 
probem being studied. It may be necessary for him to operate a device, 
however, in order that his ability to learn some other task can be ob- 
ates are used in mazes to prevent retrac- 


served. For example, one-way g 
t the problem under study 


ing. The rat must learn to open the gates, bu 
concerns the rat's ability to learn the pattern of the maze. The rat must 
be taught to operate the gates before practice on the maze is begun, in 
order that the learning of the maze pattern will not be affected in a sys- 
tematic way by a delayed learning of the gate-operating procedures, 
Temporal Variations in Apparatus. We have already noted the useful- 
ness of apparatus in presenting and registering stimuli, in controlling and 
registering responses, and in supplementing the observational powers of 
the experimenter. Continued use of a piece of apparatus requires periodic 
maintenance and calibration. Even the best-constructed apparatus will 
show variations in precision with continued operation. The performance 
of different subjects cannot be considered comparable if the precision 
of the machine registering the stimuli or responses varies during their 


performances. 

Before an experiment is actually begun, 
determine the likelihood of instrumental variation 
of precision to be maintained throughout the experi 
bration checks can then be instituted and the appar: 
an approximately constant level of precision during the testing. 

The Number of Observations. Variability of Behavior. The number of 
observations required in a scientific study is a function of the variability 
of the behavior being examined. We have emphasized several times the 
intricate nature of the orders underlying behavior. It should be apparent 
that an understanding of any functional relationship—even the most 
simple—develops from many observations of it. A single observation, al- 
though true as an existential fact, adds little to our knowledge of a phe- 
nomenon unless we relate it to other facts about that phenomenon. An 
observation in isolation has little meaning. It takes on meaning as it is 
related to other observations, both those that agree with it and those 


that are in disagreement. 


trial runs can be made to 
and to set standards 
ment. Periodic cali- 
atus maintained at 
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Consider the very commonly occurring fact of variation in the per- 
formance of an individual reacting several times under a given set of 
conditions, A runner in a track meet does the 100-yard dash in 9.6 seconds. 
Under similar conditions last week his time was 10.1, and for the week 
before it was 9.8. A tackle on the varsity football squad gains the plaudits 
of the crowd in one game by applying a key block on the opposing 
quarterback who appeared to be on his way to making the tying touch- 
down. The next Saturday he misses his tackle under much the same con- 
ditions, and the resulting touchdown becomes the margin of victory of the 
opponents. If we were given the task of characterizing this tackle’s ability 
as a prospect for all-American honors, we would want to observe his 
playing over many games of the season so there would be opportunity 
for us to learn what he would be expected to do most of the time. 

This variation in performance is found in all animal and human sub- 
jects studied in psychological experiments. Their performance varies 
from one sitting to another. If only one performance is obtained, it is 
not possible to learn how closely this one is equivalent to the average 
performance, or to learn if it is above or below the expected average. 
Only by observing many performances can we be assured of having 
enough data for computing a value that is near to or equivalent to 
what would be expected “on the average,” 


A similar problem of variability in performance exists when our task 
f a group, or when it is to characterize 


sions. We need to learn what its ch 
specific conditions of our testing si 
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interacting we need to learn how they interact “on the average” under 
the conditions of our experimental situation. 

In our previous example of the football player, we would need to 
observe him many times to learn how he would perform “on the average.” 
To determine if he were deserving of all-American mention we should 
observe him performing the characteristic tasks of a tackle, such as 
g, and running interference. We should watch him do 
all of these tasks many times. If we see him perform only part of the 
tackle responses, or observe him only once or twice performing any 
tackle response, we really lack the information necessary to determine if 
we have observed a representative performance. It should be obvious that 


to learn how our tackle would behave “on the average” in the various 
him on many occasions. 


blocking, tacklin 


tackle responses we must observe 
Getting the variables expressed in representative ways and making 


many observations of these representative performances give us stability 
in our statistics. In psychological research we want to achieve stability 
in our measurements. We want measures that will “stay put” any time 


they are used. A measure of a group's performance, such as the mean, is 
not a very useful statistic if it varies widely in value each time the group 


is measured. 


Statistical Compared with Practical Significance. The number of ob- 


servations bears an important relation to the concept of “significance.” 
If a scientific finding cannot be accounted for by the operation of chance 
factors it is declared “statistically significant.” Sometimes a finding is of 
little “practical significance” even though it is statistically significant. The 
greatest contribution is made when the finding is both statistically and 
practically significant. 

Consideration of an example will make these two concepts clearer. 
Suppose that in a program of educational guidance it is important to 
know whether the scholastic ability of upper classmen is higher than that 
of lower classmen. We administer a scholastic ability test to 1,000 fresh- 
men and sophomores and to 1,000 juniors and seniors to learn if there isa 
significant difference between their mean performances. The difference 
is found to favor the upper classes by 5 points, the mean of the lower 
classes being 175 and the mean of the upper classes being 180. Is this 
a real difference? If we mean by “real” that the difference is large 
enough to be useful in deciding what kinds of courses should be placed 


in the several college years, then the answer is no. A difference this 
nee. If by “real” we mean that the differ- 


small has no practical significa 

ence cannot i accounted for by chance factors and therefore that the 
upper classes are scholastically superior to the lower classes, then the 
answer is likely to be yes. If the groups are fairly homogeneous, then the 
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large number of cases tested would result in stable means and the differ- 
ence probably would be found to be statistically significant. E 

Designing the Study in Order to Increase the Number of Observations. 
We should design an experiment so that observations can be added to 
those completed if the latter are insufficient in number to give us stable 
measures of our variables. We are unable to predict the number of ob- 
servations we shall need to get significant results. We may design an 
experiment for 30 cases and find at the end that our results are not 
statistically reliable. If an additional 20 or 80 subjects are likely to pro- 
duce significant results, we may want to continue the experiment. In such 
a situation it is necessary that the subjects added later be comparable 
in terms of our variables with those used in the first part of the study. 
If this condition is not realized, then the expression of the variables is 
not representative throughout the entire investigation. We are then not 
justified in pooling the two sets of measures in order to gain more stable 
statistics, 

Suppose there are two groups of subjects available to us for an ex- 
periment, 35 eighth-graders and 35 ninth-graders. We use the eighth- 
graders because it is more convenient. After colle 
that we need additional subjects, and now use the ninth-graders. Suppose 
the interest and ability of the two groups are different and are related 
to our problem. The differences between the two groups introduce sys- 
tematic differences into the two parts of the experiment. A better design 


would have been to randomly select half the subjects from the eighth 
grade and half from the ninth grade 


at the beginning. The same pro- 
cedure could then be followed when additional subjects were needed. 
The characteristics of the subjects would then remain comparable 
throughout the entire study, 


Recording the Observations. The Need of Records. Analysis and inter- 
pretation of the variables in an experiment become possible only when 
records are made of their expressions. An experience lasts but a brief 


moment. Most performances occupy but a short period of time. Without 
some permanent representation of these transitory events, which can be 
leisurely examined following their occurrence, we can make little progress, 
toward understanding the variables, 

Records Should Be Comprehensive. It should be obvious that the 
accuracy and thoroughness of the analyses will be conditioned by the 
comprehensiveness of the records, The nature of the problem, of course, 
will also determine the kind and amount of the records. The records 
should be complete enough to afford an accurate appraisal of the 
variables related to the theorem under test. Inasmuch as we usually are 
unable to foretell the exact kind and amount of information we shall 
need, it is better to err on the side of recording more data than are 


cting the data we find 
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needed than to find after the collection of facts is completed that the 
data are insufficient for our purposes. 

The Accuracy of Records Varies with the Degree of Conceptualization. 
The accuracy of the records depends upon the degree to which the sym- 
bols used correspond with the empirical changes being represented. A 
natural event perceived through one of the exteroceptors arouses a less 
variable experience than an event that stimulates us through an internal 
sensory mechanism. When symbols such as words are assigned to experi- 
ences, then the less the variability of the experience, the more uniform 
will be the understanding of the meanings assigned the words, and 
therefore the closer the correspondence between the words and the 
experiences of the natural event. As we proceed from this sensory level 
of fact to levels involving abstraction and conceptualization, where 
rational processes enter into the determination of the fact, the agreement 
obtained between different observers decreases, resulting in a lower de- 
gree of correspondence between symbol and fact. In general, then, the 
highest correspondence between symbol and fact is attained in what 
we call sensory meanings, with the correspondence becoming less as 
more conceptualized meanings are introduced into the description. 

Incorrect symbolization results in incorrect meanings. Later analyses 
and interpretations made of the recorded data are then in error. In turn, 
this means inaccuracy in our conclusions and generalizations. ; 

Limitations of Apparatus Recording. A distorted and incomplete pic- 
ture of the variables in an experiment may result if instrumental recording 
is not supplemented by the observations of the scientist. A testing situa- 
d that the scientist can absent himself until the 
facts are collected. No apparatus has yet been invented that will record 
all of the changes in the stimuli and responses occurring during a psy- 
chological experiment. Certainly, some characteristics of the stimulus- 
response relationships can be accurately registered by writing levers, 
photographic film, and other means. In simple experimental situations, 
such recording has reached a high level of precision. W hen complex com- 
binations of stimuli are involved and the subject's reactions require co- 
ordinated changes in many muscle groups, the srs — 
picture may even be a distortion of the true pony oe soe 0 i = a 
It requires the observations of the scientist to detect and correct this 0 
tortion. When global behavior is under investigation the 3 
of response in the form of attitudinal and motivational changes are no 
even directly detectable by instrumental recording. 8 rte 

The argument being advanced here is that we cannot ispense a i 
observations of the scientist. No instrument has been inv ented — as 
the observational powers of the human being. The scientist, therefore, 
should not conclude that a record of his personal observations 1s unneces- 


tion is never so perfecte 
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sary because he has devised a fancy brass recorder. Apparatus 5 
is highly desirable, but one of the very important reasons for justi ying 
such recording is that it gives the scientist greater freedom in directing 
his observations upon those features that are not reducible to instru- 
mental registration. Eaa 

An example of possible overdependence upon apparatus recording is 
found in some of the time-and-motion studies of worker performance. 
Very accurate registration has been made of the manual responses of 
bench workers, such as assemblers. Stereoscopic motion cameras made it 
possible to trace accurately the motion of the arms and hands in the per- 
formance of very complicated kinds of tasks. Registration on the film 
of a time record made it possible to estimate the time of each of the 
different motions. With such accurate depicting of the time-and-space 
characteristics of the minute changes in the arms and hands during a 
task, it was possible rationally to combine part responses in such a way 
as to produce, conceptually speaking, a “best” way of performing the 
assembly, In their enthusiasm for analyzing the response records of the 
workers, the investigators lost sight of the psychological principle of indi- 
vidual differences. According to this principle it is psychologically im- 
probable that one “best” way can be found that will allow every worker 
to produce at his individual best. This ignoring of the principle of indi- 
vidual differences would probably not have occurred if more time of the 
investigator had been spent in noting to what degree and for what reasons 


one worker performed a given task in a different way from that of his 
fellows. 


Keeping a Daily 
the conduct of an e 
and interpretations of the data. The scientist should not trust to memory 
all of the little deta: 


be recalled later during the analysis and interpretation of the experi- 
mental results. O 


oe on ii card, sheet, film, or tape on which the performance 
itself is registered. Another good practice is to keep a daily log or note- 
book. This record should a We 


Collecting the Facts 203 


tain any fact arising during the collecting of the data that at a later time 
might prove useful in the interpreting of the performance records of the 
subjects. 

SELECTED READINGS 


Davis, R. C.: Methods of Measuring and Recording Action, in T. G. Andrews 
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CHAPTER 10 


The Organization, Analysis, and Interpretation 
of Facts 


The general purpose of the organization and analysis of data is to derive. 
meanings by which the results of an investigation can be correctly id 
terpreted. The data as collected in an experiment, survey, interview, oF 
other type of study are often not organized or arranged with particular 
reference to the problem in hand. The initial organization is frequently 
dictated by convenience. Furthermore, the meanings inherent in the data 
are not obtainable from a cursory examination, Many logical and statisti- 
cal analyses are required to extricate these meanings. What then is needed 
is a reorganization of the data in terms of the purpose for which the 
study is being made and the execution of such analyses as will reveal 
all meanings relevant to this purpose. 
Organization, analysis, and interpretation 
the facts are collected but are begun as so 
Few studies are so perfect in design th 
arise during the period of collecting the fa 
of how the variables are interacting, whic 
of a study, provides the basis for adjustm 
increase the precision achievable in the ! 


are not postponed until all 
on as the first facts are in. 
at no questions or problems 
cts. Frequently, the knowledge 
h is gained in the early phases 
ents in procedures that greatly 
ater phases of the study. 


GENERAL OBJECTIVES OF ORGANIZATION AND ANALYSIS 


Working over the facts that are collected serves many purposes, not 
all of which may have been in mind when the study was begun. We try 


primarily to learn if the facts confirm the hypothesis around which the 
study was evolved. Regardless of whether the findings are favorable or 
unfavorable to the hypothesis, we must determine if the results are re- 
liable. If they are reliable, we then proceed to determine the explanatory 
ndings through the process of generaliza- 
se three phases of our examination, unan- 


We may encounter problems that lead us 
204 


tion. While going through the 
ticipated events may happen. 
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off into new areas of research. We may evolve ideas for changing our 
hypothesis. We may get “bogged down” trying to determine the meaning 
of some unique relationship that we discover in the data. Because of the 
uncertainties and surprises involved, extracting meanings from our ex- 
perimental results is one of the most fascinating and exciting phases of 


the scientific method. 
Relating the Facts to the Hypothesis. The Use of Simple Classification. 


Usually the first step in organizing the data is to classify them in respect 
to some characteristic that is significant for the theorem under study. 


The theorem has stated or implied certain consequences which presum- 


ably would follow from the conditions of the experiment. Classification 


arranges the data to determine if the consequences obtained are the 
same as those stated or implied at the beginning. If they are the same, 
the evidence is accepted as confirming the hypothesis. 

Suppose we conduct a survey in which we select the subjects from sev- 
eral nationality groups. This study might be concerned with the effect 
of nationality background on opinion toward the use of war. Our hy- 
pothesis might be that the degree of tolerance for war as a national in- 
strument is a function of nationality. We decide to investigate the theorem 
that Nordic, Alpine, and Mediterranean nations differ significantly in 
their opinions concerning the conditions under which a nation should 
resort to war. We construct a questionnaire for sampling opinions of these 
three national groups with respect to their tolerance toward the use of 
war. We collect the opinions from our respondents as convenience and 
feasibility dictate, and the responses are arranged simply in the F 
which they are obtained. The first variable to serve as a basis for classifi- 
cation and one that is basic to the problem is that of nationality. The 
records are then rearranged in terms of the nationality of the respondents. 
With the responses so classified it now becomes possible to apply W 
descriptive techniques, such as computing an average tolerance en 
each respondent based on the number of his responses favoring or dis- 
favoring the use of war. An average tolerance score can then be com- 
puted for each of the nationality groups. These averages are new mean- 
ings being extracted from the data. They add more precise information 
ab varie ic to the ; 

Tha eee at space Classification. In most experiments there = be 
several variables functioning. It is, then, possible to classify the ata 
several different ways. These classifications are not equally — 
for the theorem, but contribute important information about the variables 
on which the test of the theorem is based. ; 8 

In the example cited we could collect information on the and 2 = 
of the respondents. Then we could classify the subjects on “a she 15 
two variables independently of the other variable and independently o 


hypothesis. 
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the variable of nationality. We could use simple classification to do this. 
On the other hand, we could use multiple classification procedures. We 
could first classify according to nationality, then within nationality classify 
according to sex, and then within sex classify according to age. Such 
multiple classification makes possible many new meanings and greatly 
facilitates our interpretation of the results, For example, having computed 
a tolerance score for each age group within each sex within each nation- 
ality, we could proceed to evaluate the contribution of these three vari- 
ables of nationality, sex, and age. We could compare age groups within 
sexes within nationalities, or age groups between sexes within nation- 
alities, or age groups between sexes between nationalities. We could com- 
pare sexes within nationalities, or sexes between nationalities. We could 
compare nationalities within sexes or nationalities within age groups. Of 
course, these comparisons are not equally meaningful or relevant to the 
hypothesis. It should be apparent, however, that some of these compari- 


sons would shed considerable light on the nature of the variables under- 
lying the facts that we have collected. 


usually is to be considered busy work. Occa- 


pon a significant finding 
y related to his problem. Such 


sionally in such byplay 
which was not in any preconceived wa: 
findings, however, are indeed rare, 


evaluated in our attempt to 

variation is a function of three factors, 
namely, the generality of the hypothesis and the theorem developed from 
testing situation, and the psychological nature 


ty in performance 


- Regard] f how 
narrow the hypothesis or how high e. Regardless o 


a precision is attained in the test 
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situation, there will continue to be variation in the performance of sub- 
jects because of the inherent variability of psychological abilities. This 
variability must be evaluated in our effort to confirm the hypothesis. We 
shall return to this problem in a later section. 

The Consequences of Negative Results. If the findings do not verify 
our theorem, the hypothesis is not confirmed. When this occurs, either 
one of two alternative actions may be taken. The hypothesis may be 
adjusted to conform to the new evidence and further tests then made. 
Or the hypothesis may be abandoned and others sought. The first pro- 
cedure is followed when other lines of evidence support the hypothesis or 
when the hypothesis seems to be working successfully in terms of its 
predictive effectiveness. The second alternative is followed when the evi- 
dence is overwhelmingly against the hypothesis, and especially when 
other hypotheses of more promise remain to be evaluated. 

Evidence from a test may be sound and still not increase our knowl- 
edge about the hypothesis. This is because the theorem tested is not 
pertinent. It may be found that the results favor another hypothesis 
more than the one that is being examined. This is not likely to occur 
when the steps involved in the deductive elaboration of the hypothesis 
have been correctly executed. 

Determining the Reliability of the Meanings. Some Logical Aspects of 
Reliability. The confirmation or disconfirmation of a hypothesis is based 
on the pertinence of the evidence, the amount of the evidence, and the 
relation which the evidence bears to alternative hypotheses. 

Verification of any hypothesis can only be approximate, regardless of 
the relevance or amount of the evidence. The results of a test situation 
are examined in terms of the number of facts favoring the hypothesis and 
the degree of favorableness of these facts. The results of the test are com- 
patible with the hypothesis only in terms of degree and not in any all-or- 
none sense. Variation in evidence means variation in degree of com- 
patibility, The acceptance or rejection of a hypothesis is then never final 


in any absolute sense. 
The favorableness of 
matter. It depends, in part, 
favor one or more other hypot 
ence to alternative hypotheses. In 
pothesis we attempt to devise theore 


rectly from the hypothesis. We also a 
test situations so they will be either unrelated or opposed to rival hy- 


potheses. This objective is easy to state but difficult to accomplish. There- 
fore, in the analysis of the results we must determine if and to what 


degree the findings are related to rival hyp 
The significance of the results for any h. 


the facts for a given hypothesis is a relative 
upon the extent to which the results also 
heses. The facts are then examined in refer- 
the logical development of our hy- 
ms and test situations that issue di- 
ttempt to model the theorems and 


otheses. 
ypothesis also rests upon the 
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degree to which other factors than those purposely varied by the investi- 
gator have operated in the test situation. The presence of these factors 
may vitiate the relationship that the results bear to the favored hypothesis. 
They may also be very pertinent to competing hypotheses. If in an experi- 
ment the theorem required the determination of the correlation between 
reading ability and college success and we failed to evaluate such factors 
as high school preparation, scholastic aptitude, and study habits, we 
would not obtain a reliable measure of the relationship that we sought. 
One of the tasks in organizing and analyzing the results is to determine 
the method of operation of any vitiating factors and to remove their 
effects from the results during the process of relating the results to the 
hypothesis under study. There are statistical procedures by which this 
can be done, providing the study was originally designed to encompass 
these factors. The methods of partial and multiple correlation and of 
analysis of variance and covariance are powerful tools for accomplish- 
ing this purpose. 

The Statistical Reliability of the Results. The reliability of performance 
of a subject refers to the extent to which his performance can be at- 
tributed to the functioning of systematic variables. We know that the 
measurements of psychological characteristics are subject to many vari- 


able factors and we can expect errors of measurement even under the 


best-designed testing situation possible. Even after carefully standardiz- 
ing all of the experimental procedures, we cannot be sure that the per- 
formance obtained from a subject is represent 


ative of his ability, that is, 
that the performance we obt 


ained would closely approximate what he 
would do on the average if we tested him a large number of times. Actu- 


ally, what we would get in repeating the measurements would be a varia- 
tion in his performance (see Fig. 1, Chap. 5). We would like to obtain 


pra is called his representative or “true” performance for the set of con- 
5 under which we require him to perform, A given measurement 
is reliable to the extent that it approximates the value of this true 
performance. 


5 meaning of unreliability deals with the degree to which 
best es that do not operate systematically are determining the results. 
nterest is centered on the factors that result in unsystematic or variable 


errors, that is, variations in the subject’s performance that occur in both 
positive and negative directions to about the same amount and in about 
the same frequencies. We increase reliability by reducing these factors 
giving rise to variable errors. In Previous discussions we have referred to 
these factors as chance factors. Regardless of how well we think we have 
prevented these chance factors from operating, it is still SS i 

analyzing our results to e P 


valuate their contribution to the subjects’ per- 
formance. 
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The Evaluation of Chance Variation. From the arguments on the mean- 
ing of reliability it is seen that one of the important objectives of organi- 
zation and analysis is the evaluation of chance factors. We assume that 
these chance factors affect the performance of subjects in a random 
fashion, so that the values of the experimental variable will be increased 
and decreased equally. Furthermore, we assume that small effects upon 
the experimental variable will occur more frequently than large effects, 
and that very large effects will occur very infrequently. 

As noted in Chap. 5, for most psychological variables chance variation 
will follow a definite function called the law of error. The law of error 
is very important in that it enables us to evaluate the contribution of 
Vhen we perform an experiment only once, 


chance error in our results. V 
rn exactly how chance actually 


we do not have the opportunity to lea 
operates on the performance of our subjects. We therefore estimate that 


for our results the law of error found characteristic of most other psycho- 
logical factors would be applicable to our variables. It then becomes a 
problem of estimating the amount of the chance error and taking it into 
account when we are working with the variation attributable to the 
changes introduced in the experimental conditions. If we know the total 
amount of change reflected in the experimental results and can estimate 
the amount of change that could have resulted from the operation of 
chance factors, we have a basis for deciding whether or not significant 
variation can be attributed to the experimental variables, Said another 
way: we obtain information about the significance of our experimental 
variation by estimating the probable amount of variation due to chance 
factors. If the error variation is so small as to be insignificant we have 
confidence that the experimental variation is significant. This is one of our 
most important procedures for evaluating our experimental results. 

In a study by the authors on the effect of vitamin deficiency on the 
rat's general activity level, the mean activity scores of the deficient rats 
on each day were higher than the mean scores of the normal rats. 
Furthermore, the two activity curves based on the means became more 
and more divergent as the experiment progressed. A more thorough ex- 
amination of the data, however, showed that the difference between the 
two curves was not significant. Chance factors operated to produce wide 
fluctuations in the performance of both deficient and normal rats, vitiat- 
ing the comparison based on the diet variable. — 

Facilitating Generalization and Prediction. The Objective of Generali- 
zation and Prediction. The objective of both generalization and predic- 
tion is to go beyond the immediate facts and meanings established in the 
testing situation. As we have seen, a hypothesis is never actually “proved. 
Rather, it is confirmed in those aspects that have been set to empirical test 


See Chap. 11, p. 255, Fig. 10. 
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by some form of experimental or other kind of study. Not only should a 
hypothesis be consonant with the facts collected in the testing of its 
theorems, but it should be compatible with the facts of many other situ- 
ations and conditions. No single study of a hypothesis provides us with 
knowledge of all of its many ramifications and implications. Generaliza- 
tion functions to apply the hypothesis as broadly as possible. It carries 
its implications into new areas where the variables and factors operate 
somewhat differently than they did in the situations already studied. 
Generalization actually points the way to additional studies that need 
to be done. 

Generalization also forms the basis for making predictions about the 
consequences that would be expected to occur when the hypothesis is 
applied to a situation containing components not yet examined. This pre- 
dictive function is very important in science. It is the procedure used in 
justifying the generalizations with which science is concerned. The 
success of any prediction will be conditioned upon the degree to which 
the predicted situation agrees with the predictor situation. Organization 


and analysis aid us in determining the characteristics that are essential 
to the hypothesis. This information then enables us to discover new situ- 
ations in which we can make effective predictions. 


Establishing Meanings to Be Used in Generalization and Prediction. 
Organization and analysis are the means for establishing the nature of the 
he data and also of determin- 
ich these generalizations and 
asic to generalization and pre- 
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SOME COMMON CHARACTERSTICS NEEDING DESCRIPTION 


Whether we are studying humans or animals, groups or individuals, 
complexes of behavior determiners or relatively simple reflex actions, 
there are certain characteristics of response that past research has demon- 
strated are important and that should be described quantitatively when 
possible, Actually these common characteristics are very fundamental 
meanings which are extremely useful in understanding many different 
kinds of data. A thorough familiarity with them as general concepts is 
expected of any scientific psychologist. The three most important of these 
characteristics can be stated in general form as follows: the level of 
achievement, the variability of performance, and the relationships among 
performances that are alike in one or more characteristics. 

The Level of Achievement. Meaning of the Concept. Interpreted 
broadly, level of achievement refers to “goodness” of performance. It is 
assigned a more exact meaning according to the nature of the particular 
response being evaluated and the purpose of the evaluation. Regardless 
of the way a performance is described, to assign the meaning of level of 


achievement is to assign some value to the performance. 
ride variety of situations in which the 


Some examples will show the w i a 
meaning of level of achievement can be utilized. One of the most familiar 
acher represents students’ per- 


situations is the classroom, where the te i 
formances in terms of grades. Evaluation of level of achievement helps 
to answer such questions as the following: How is Jack doing in school? 
Does he have the ability to graduate from high school? Does he show a 
particular aptitude for certain school subjects and an ineptitude for 


others? 

Evaluations of achievement 
dustry, Management needs to know th 
on the job. It asks such questions as: 
Jones’ production still suffering from 


Operators turning out enough units of work? ; ; 1 
In the areas of personal and social adjustment, information on achieve- 


ment is of importance in evaluating the individual’s success in getting 

along with his fellows. We desire to know if Bill gets along well with his 

older brothers, if he has a high regard for his parents, if he approves 
> 


of the efforts of his family in helping him to meet his college expenses. 
Evaluations of adjustment are needed in such social situations as are 
Provided by the school, the fraternity, the club, the gang, se pe 
are needed to such questions as: Does Bill have many male en s + 
he popular with the girls? Is he invited to parties and ei : = : 
been bid to a fraternity? What interest does he show in schoo! activi es 


are constantly made in business and in- 
e relative effectiveness of workers 
Are Smith’s sales up to par? Is 
his recent illness? Are the lathe 
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It should be apparent that many different special meanings can be 
given to the term level of achievement. These meanings vary with the 
nature of the response situation and the purpose for which the meaning 
is needed. It should also be apparent that different evaluation procedures 
will be required to measure level of achievement as it is 
ferent special meanings. 

Ways of Representing Goodness of Performance. Level of achievement 
may be represented symbolically in three different ways, namely, by 
word descriptions, by assigning one of several large quantitative cate- 
gories, or by assigning a numerical score. 

Description by numerical scores is considered the most accurate kind 
if the assignment of the number meanings can be justified. As we noted 
in Chap. 6, counting is a number process readily applied to behavior 
when the behavior is manifest in the form of unit responses. For ex- 
ample, the number of items assembled by a bench worker in a specified 
period of time is a numerical description of the amount of his perform- 
ance. The number of pages completed by a typist is a similar score. 
Quality or accuracy of performance is also represented through count- 
ing. The number of items spoiled by the bench worker, or the number 
ai ARONS made in the copy by the typist, are quantitative scores repre- 
senting maccuracy of response. Numerical description of level of achieve- 
ment is then possible when the performance is divisible into small com- 


aral its. N ; 
= ble units. Numerical scores offer a very adequate symbolic represen- 
on. 


given these dif- 


adily divisible into comparable small 
of description are used. It is difficult 
strength of political conviction by 
ts of conviction. We then resort to 
ariable. These categories are located on 
arious values of a single magnitude. For 
red by means of a rating 


of categories. A 
e rated by acquaintance 
quently the categories are 


as in the case of letter grades assigned students’ 


performance in the classroom. 
The use of numerical sco itati 

quantitative categories has not 
Levels of achievement in more 
4 accurately represented solely 
Of cour imi d riza- 
dion of cock ek — se, a limited characteriza 
ed, and different scores can be 
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used for describing different aspects of the behavior. The meanings of 
several separate scores become less accurately applicable, however, as 
the nature of the behavior being represented becomes more general. 
Separate scores then do not represent or describe the unitary nature of 
the behavior. They must be supplemented by word descriptions. 

The need for word descriptions is readily observed in the area of per- 
sonal adjustment. Certainly there is no question but that some individuals 
are better adjusted than others. The better adjusted persons can be con- 
sidered to have reached a higher level of achievement in behavior. We 
can devise several large quantitative categories by which the different 
levels of achievement can be represented. To depend upon a single cate- 
gory to describe the level of adjustment of a given individual, however, 
is to ignore further refinements in evaluation that can be realized through 
word descriptions. Whether these word descriptions are called quantita- 
tive or qualitative in nature is not of great consequence as long as they 
contribute to a better understanding of the level of achievement. Sup- 
Pose we are concerned with the social adjustment of a male college stu- 
dent. One area in which the effectiveness of his adjustment would be 
reflected is his association with members of the opposite sex. We could 
obtain a rating of the individual as to whether during the past year he 
had had a large number, a few, or hardly any girl friends. A rating that 
he had a large number of girl friends fails to give us an accurate o 
tion. He may be genuinely popular, and so finds it easy to 5 9 
girls. Or he may find it difficult to remain very long on gooc am wit 1 
girl and thus has to change his girl friends often. In either case he = 
receive the same rating. In this type of situation a description i = 
categories or in numerical scores is not sufficient because 7 goodness o 
behavior cannot readily be represented in a single-type value. 

Terms of a Standard. Whether 


Ev i ss of Performance in 
Stee aon scores are used, the 


ner ri r numerica. 
words antitative categories, 0 4 oe 
s, large quantita comparison of the performance of the individual 

a f 


ard. The behavior being evaluated is re- 
4 n . 
bout the behavior arise from com- 


evaluation requires 
or the group with some stand 
ferred to some norm, and meanings a 


j z “i m. 
ltc the behavior with the tay that Bill is getting along very well 


If, in a word description, we say : hich his pebavior ts being svalt- 
; : i w 
Socially, there are social norms agams 
> a 


means agreement with the social norms. These 
a o 


at « ke Ki 8 . ; 
cet Getting along magi in different societies nor within different 
ms will not be the sa there will be local, neighbor- 


strata of t me society. Furthermore, 
hood, or | ae of behavior that will be used as standards. There 


s neral socie 
may be differences between the local w ays and more = 5 Aa ra 
Ways. A ence, different evaluations of the same behavior may 

As a consequ > 
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be in disagreement when gauged by norms of different kinds. For al 
ample, a delinquent act of a teen-ager may be appraised differently by 
a judge in court than by the fellow members of his gang. 

Evaluation of performance by means of large categories is also norma- 
tive in nature. The goodness of an A grade in school arises from the 
fact that this descriptive category is applied to a certain select few from 
among a large group, the remainder of whom are assigned other letter 
grades denoting lesser achievement. 

The meanings of a numerical score are gained from a knowledge of the 
frequency of that score as compared with the frequency of other scores 
and from a knowledge of the characteristics of the group of individuals 
in question. Knowledge of the group's characteristics is needed although 
seldom brought to our attention. Its importance will be 
sider the evaluation of a child who has been held back 
Suppose that according to chronological age a child be 
grade but actually is only in the fourth grade, Supe 
of the child measured in terms of the performance of 
dren would be considered less than superior when me: 
the behavior of children of his own age. 


Meanings of an individual’s performance can be gained by comparing 
his score with other scores of a group to which he belongs, provided he 
possesses characteristics common to the others in the group, such as age, 
educational achievement, physical growth, conditions of health, etc. The 


common characteristics, of course, should be relevant to the performance 
being evaluated, Suppose that Bills score is 25. If 
class is zero, this knowled 


seen if we con- 
a year in school. 
longs in the fifth 
rior performance 
fourth-grade chil- 
asured in terms of 


We see, then, that meanings of an individual’s level of achievement are 


namely, relations between the performance 
norm of performance. The norm 
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achievement in any given activity measured on different occasions in the 
same person. When one performance is obtained from each of many 
subjects, we must deal with interindividual variations, usually referred 
to simply as individual differences. When many performances on one 
task are obtained from one subject, we must deal with intra-individual 
variations. When several performances on a given task are obtained from 
several subjects, we must deal with both kinds of variation. 

Individual Differences. The study of variation in level of achievement 
among individuals forms a very important chapter in psychology. Recog- 
nition of this concept is of prime importance in the schoolroom, at the 
factory bench, in the business office, in the training of children within 
the home, in fact, in every situation where the behavior of more than 
one person is involved. Individuals differ in response under conditions 
which physically appear to be alike. Knowing the extent of these differ- 
ences enables us to know to what extent we should vary the training, 
the discipline, the treatments, the recognitions, the awards, etc., by 
which each individual is encouraged and enabled to attain the highest 


level of adjustment. 


Variations within the Individual. To understand a person and to make 


successful predictions about his behavior, it is necessary to study intra- 
individual differences. When an individual performs an act many times, 
instead of being the same on all trials, his scores vary in amount and 
form a distribution around his average performance. The accuracy of our 
prediction about any future performance of the individual rests upon 
the extent of these variations in his performance. If he fluctuates wildly 
our prediction will be less accurate than if he performs consistently within 
narrow limits. It is then important that we be able to measure or esti- 
mate intra-individual variations. i , f 
Suppose that in the manufacture of mechanical toys a machine requires 
the close attention of a machine tender. The machine is set to operate 
within a narrow range of speeds that will result in the greatest number of 
acceptable toys in an 8-hour period when operated by the — 
worker, It is then important that the variability of the machine tender 
comes within the tolerance limits of the machine. If his performance 
varies to the extent that he gets behind the machine, then the toys pro- 
duced will be defective, the machine might get clogged, and the waste 
in materials and labor could be considerable. The tender might even 
try to operate the machine faster than it is set for. Again, = chances 
would be high for jamming the machine and for producing de ective toys. 
Measures of the consistency of performance under distraction 8 
often required. Some individuals seem unable to readily adjust to 
tracting stimuli, and their behavior fluctuates wildly when such stimuli 
are encountered. Consistent behavior under fire is desired in the operator 
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of a phone exchange during an emergency, such as an earthquake, in 
the aircraft gunner during an air raid, in the surgeon during an emer- 
gency operation, or in the chairman of a political body during a heated 
debate. 

Another form of variation within the individual concerns the con- 
sistency of response in different areas of behavior. No individual is 
equally variable in all of his activities. A measure of his response con- 
sistency in important kinds of activities greatly improves our under- 
standing of the individual and his potentialities. For example, it is im- 
portant to know the variability of an individual in respect to various 
traits of personality. When we ask the question: Can Mr. Jones be 
trusted to do the job? we are inquiring about several kinds of behavior. 
We might have in mind consistency of response in respect to his ability 
to organize the different components of the job, his facility for working 
with people, his available drive and motivation, his tolerance for routine 
work, or other such behaviors connected with the job to be done. 

The Analytical Use of Measures of Variation. Brief mention should be 
made of the fact that the characteristic of variation can be used in evalu- 
ating the contribution of each of several variables determining a complex 
act. Simply stated, the variation of a given complex form of behavior 
stems from the variation in the many factors underlying it. These factors 
will not vary equally among themselves, Some knowledge about the im- 
portance of the different factors can be gained from the relative propor- 
tion of the total variation contributed by each of the factors. A statistical 
procedure, called the analysis of variance, has been devised by which 


this apportioning of the variation among the determinants can be 


achieved. It is beyond the scope of the present text to explain this pro- 


be the reader should be aware of the fact that through the 
evaluation of the characteristic of variation it is often possible to discover 


the relative importane h 
e of the several determi y A lex 
miner ex 
event. s of a given comp 


Relational Meanings among Variables. Relational 


rom Comparisons of Variables. We |} r know 
f ` have noted ie l- 
eek is earlier that ou nov 


ariable is evolved b ine 
compa: i arees 
Facts are relati y paring it with other variables 


onal in nature. Facts ab ; 
F ~ 2 out a 1115 
in which it rel it a variable include the ways 


ates to other variables. Meanj 
‘ 8 Meanings, as eee 
relationships among variables ings, as descriptions of the 


themselves. reveal characteristics of the variables 


One Wa of relating variables is to note the presence or absence of 
common characteristics. Variables are described as similar in nature 
cteristics, They differ in nature if each 
e unique. We noted earlier that this type 


Meanings Arising 


possesses characteristics that ar 
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of relational meaning is utilized in interpreting level of achievement by 


comparing a given performance with some standard or norm. 

atement of a relationship is made possible when 
quantitative categories or numerical scores are used to describe the 
variables, Differences in magnitude of one variable may be related to 
d variable. For example, for eighth- 


A more precise st 


differences in magnitude of a secon 
grade students it might be found that the higher the score on a reading 
test, the more likely the success of the individual in mastering high 
school subjects. Mastery of high school subjects could be indicated by 


averaging the grades at the end of some stated time, such as at the end 


of the freshman year. The average grade category of each student could 
he reading test. It might be found, 


then be compared with his score on t 
in general, that the higher the reading score, the higher the average 
high school grade. Such a relationship could be quantified by means of 


correlation. 
Analyzing the Composition of a Complex Variable. A frequently oc- 


curring research objective in psychology is the determination of the 
nature of the composition of a given complex act. The determinants of 
complex behavior often function simultaneously in very involved inter- 
relationships. Through the process of analyzing the act into its parts and 
seeking the nature of the relationships among the parts, significant im- 
petus is given to our understanding of the act as a whole. 

To illustrate the compositional analysis of a complex variable, let us 
consider the important characteristic called ability, which has been 
under scientific study for many years. Numerous attacks have been di- 
rected toward learning about its nature. All of these attacks start from 
two fundamental bases that can be considered as necessary assumptions. 
First, regardless of our definition of ability, we agree that it is expressed 
or manifest in the objective behavior of the individual. Secondly, the 
individual differences in performance so readily observed objectively 
among individuals are considered to reflect these differences in ability. 
The type of ability that has been studied most often is that reflected in 
performance on mental and motor tests. Performance on the different 
kinds of tests is considered to be the expression of one or more abilities. 
The diſferences in test performance of different individuals are consid- 
ered to reflect differences in these abilities. As a result of many research 
studies, the general conclusion is being formed that the variations in 
performance obtained on a large variety of tests can be referred toa 
small number of abilities. Research is now being directed to determine 
just what these primary or necessary abilities are and how many must 
be postulated to account for all the performances observed in a situa- 


tion involving a large number of tests. 


218 Steps of the Scientific Method 


Another compositional analysis might be concerned with the deter- 
mination of the various components of a group’s attitude toward political 
freedom. Certainly, the psychological factors conditioning any group’s 
stand on a given social issue are large in number and contribute in vary- 
ing ways and amounts to the expression of opinion by its members. One 
of our objectives is to discover the picture of the components. We could 
study them as simultaneously functioning contributors to the current 
expression of the group. Our control and prediction of the group’s per- 
formances would be improved through such a compositional analysis. 

The industrial psychologist also is faced with a problem requiring 
compositional analysis when he endeavors to define successful perform- 
ance on a given job. He must learn how ability, 
incentives, home conditions, vacations, procedur 
promotion, union membership, overtime pay, management’s supervisory 
policies, and other similar factors function and interact with one another 
to condition the job performance of the workers. 

Relationships Ordered in Time. Determinants of behavior can be stud- 
ied as they are related in time, one preceding, another following. This 

n mentioned as one-of the important orders 
early every project he undertakes. It differs 
e of relationship described above only in the 


on the temporal character of the relationship. 
One type of temporal relationship 


training, job experience, 
es for advancement and 


with increasing chronological age. 
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USEFUL PROCEDURES FOR ORGANIZING, ANALYZING, 
AND INTERPRETING DATA 


For convenience of exposition, the various methods available for dis- 
covering and assigning meanings to the original performance records of 
a study can be subsumed under the following three headings: tabular 
procedures, graphical procedures, and numerical procedures. Although 
certain tabular procedures often are utilized first, there is no particular 
order in which these three types of methods should be applied. In fact, 
they are to be considered closely related rather than distinctly separate 
procedures. For example, we may use a graph for purposes of analysis 


and get from it ideas for applying numerical procedures. Or, after per- 
s, we may make a graph of the results 


forming certain numerical analyse 
hat the numerical computations have 


and get a better conception of w 
accomplished. 

A very large number of tabu 
are now available to the researc 


Jar, graphical, and numerical procedures 
h worker. It will be possible here to pre- 


sent only the broad outlines of the purposes served by a few of these 
procedures. In addition to the procedures described there are many 
others with which every scientist should be familiar. 

Some Tabular Procedures. Uses of Tabular Procedures. The two pri- 
mary purposes served by tables are those of organizing the data and of 
facilitating numerical computations. As we learned earlier, the arrange- 
ment of scores or other measures recorded during the conduct of an 
experiment is usually based upon convenience. For example, if we were 
running subjects in a study on learning we would very likely arrange our 
data chronologically, that is, according to the days of the month on 
which the subjects were tested. If the purpose of the experiment were 
to determine if auditory presentation of the material were less effective 
than visual presentation, the data must then be rearranged in order to 
Provide a basis for comparing the results obtained from the two modes 
of presentation. Tables enable us to organiz 
for extracting information about the theorem under investigation. 

Tabular arrangement of data often facilitates the computation of statis- 
tical constants. In later sections, mention will be made of certain constants 
used in measuring average performance, variability of performance, and 
correlation between variables. When records are being analyzed from a 
large number of subjects, there is considerable labor involved in comput- 
ing any statistical constant. By arranging the data in tabular form, certain 
short-cut methods of computation can be used, and much time and labor 


is then saved. : 
Tables of Classification. Tables are basic to the ordering of facts 
through classification. Facts must be organized in reference to the ex- 
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perimental variables. In the example of a survey on tolerance toward 
war, several possible ways of classifying the data were mentioned. In 
Table 2, a multiple classification is made of fictitious tolerance scores 


Table 2. Comparison of Fictitious Average Tolerance Scores 
by Age, Nationality, and Sex 


Nordic nations Alpine nations Mediterranean nations 
Age Groups | Ilale | Female | Both | Male | Female | Both | Male | Female | Both 
A B 0 D E F 8 H I 
10 8 9 11 11 11 14 15 14 
12 15 14 14 14 re 16 17 17 
15 16 15 17 16 17 17 18 17 
16 18 17 19 19 19 18 18 18 
17 17 17 21 20 20 19 20 19 
20 19 19 22 22 22 19 19 19 


with the variables of age, nationality, and sex as the axes of organization. 
Each entry would be the average of the scores of several respondents. 
Letters have been inserted in some of the columns for reference purposes. 

An examination of Table 2 will show the ease with which various com- 
parisons can be made. Comparison of nationalities by age groups can 
be done by studying the columns labeled C, F, and I. The tolerance 
scores of males can be compared between n 
by using columns A, D, and G. Similarly, females can be compared be- 
tween nationalities through columns B, E, and H. Comparison by age 
groups of the tolerance scores of male and female within each nationality 


can be accomplished through columns A and B, D and E, and G and H 
for the three nationality groups. i 


The Frequency 
of tables is the fr 


ationalities by age groups 


Distribution. One of the most frequently used kinds 
equency distribution illustrated in Table 8. The data 
tabled consist of 200 scores ranging in value from 0 to 130, Obviously. 
great difficulty would be encountered if we tried to 1 meaning 


—— a gene 7 scores 200 numbers in length. We would do bet- 

er to order the scores from low to high and à 3 y 
A > to co arger 

units called class intervals. The interval mbine them into larg! 


k used in Table 3 has the value of 
10. In preparing the table we would use some form of tally mark, as in- 
dicated, which enables us in one step to translate the scores from any 
heterogeneous arrangement in which we find them to the ordered ar- 


bo 
to 
— 
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Table 8. Frequency Distribution of Scores of 200 College 
Students on an Intelligence Test 


i Class Tally Simple 
interval i frequency 
0-9 Il 2 
10-19 Ill 3 
20-29 MHL II 7 
30-39 WN / 11 
40-49 | NUMINN I 22 
50-59 c NC MN N / 34 


60-69 N NN. NN / 43 


70-79 W L) UTR /, 33 
80-89 WN Mr / 21 
90-99 PAUL Il 12 
100-109 | MW // 7 
110-119 Illl 4 
120-129 | / 1 
Tatal ae gaa e a a APA 200 
. ̃ ᷣͤ . ³¹1 


tion. The frequency of scores within 
ount of the appropriate tally marks. 
o columns, the class intervals 


rangement of the frequency distribu 
each interval is determined from a ¢ 
The table in its final form consists of only tw 
and the frequencies. 
Many meanings ar 
bution. We can learn the approxim 
scores, The greatest concentration © 
having the highest frequency. The c 
end of the distribution to the other enable us t 
shape of the distribution of scores. 
If the scores of two or more grou 


e obtainable from examining the frequency distri- 
ate value of the lowest and highest 
f scores is revealed by the interval 
hanges in the frequencies from one 
o form a notion of the 


ps are to be compared, the frequencies 


of scores for the several groups can be arranged in parallel columns in 
the table. The meanings noted above can be extracted for each group 
and comparisons across groups can readily be made. Seta 
The Per Cent Frequency Distribution. This type of distribution is used 
when groups are to be compared that contain different numbers of cases. 
e name per cent frequency distribution is given to this type of dis- 
tribution because the frequencies are translated into fractions of 100 and 


thus expressed as percentages. 
Suppose we have the test s¢ 
more group of 155 cases and a ju 


olastic-ability test of a sopho- 


ores on a sch 
£ 310 cases. The distributions 


nior group o 
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Table 4. Comparison of College Sophomores and Juniors on Fictitious 
Scholastic-ability Test Using Original Frequencies and 
Per Cent Frequencies 


Frequencies Per cent frequencies 
Score 
values 
Sophomores | Juniors | Sophomores | Juniors 
0 5 10 3 3 
1 7 15 5 5 
2 10 19 6 6 
3 15 31 10 10 
1 24 48 15 15 
5 33 65 21 21 
6 24 49 15 16 
7 16 31 10 10 
8 10 21 6 7 
9 7 13 5 4 
10 4 8 3 3 
Total. 155 310 
=i tC, 


for these two groups are provided in Table 4. Comparison of the groups 
on any given score value is difficult because of the difference in total 
number, For example, for a score value 2 there are 10 sophomores and 
19 juniors. If we considered these frequencies alone we might conclude 
that a greater proportion of juniors obtained this score than did sopho- 
mores. This difficulty is readily overcome by expressing the frequencies 
as percentages. This is done in columns 4 and 5 of the table. In the sopho- 
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variables on a two-dimensional graph using rectangular coordinates. 
One variable is placed on the abscissa (horizontal axis) and the other 
on the ordinate (vertical axis). The score values or score intervals of the 
two variables are recorded on the respective axes. Paired values are then 
plotted, a dot being placed at the point of coincidence of the values 
of a given pair. For example, in Fig. 2 the relationship is plotted between 
the height and weight of 20-year-old boys. For simplicity, only five pairs 


180 


170 

v 

8 

2 160 

a 

E 

= 

$ 150 

Ht. W. ds) 
David 5'1" 130 
Jack 5 7, 149 

14 Bin 6 o, 158 
j Richard 6' 3" 169 

Harold 6'5" 72 


* 6˙0˙ 
= api in feet and inches 


Fic. 2. Relationship between weight and height of 20-year-old boys (measures are 
fictitious). 


of measures are used. Within the figure are the paired values for five boys. 
Consider the height and weight of Richard. We locate his height of 6 
feet 8 inches on the abscissa and bis weight of 169 pounds on the ordi- 
nate, We then extend imaginary lines on to the graph from these values 
and where the lines intersect we place a dot. Thus there is one dot for 
each pair of values, that is, one for each boy. A line is drawn through 55 
plot of points in such a way as ve the least amount of deviation o 


to lea 
the points away from the line on either side. A l 
Renesas of the graph reveals a pronounced positive relationship 
between height and weight. As height increases, weight increases. The 
plot of points is represented better by a straight line than by a curve. 
This then is a rectilinear relationship. ‘Although such relational meanings . 


can also be detected in the corresponding pairs of pricy i 9 
much more difficult to extricate from the numbers than from the graph. 
eans for communicating relational 


Graphs are very popular as à mm 
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meanings because of the inherent ease of visual perception. Far = 
effort is required to examine and analyze relationships portrayed gr ap 
cally than when they are represented by columns of numbers. Ina graph, 
the essential nature of the meaning is abstracted and symbolized by lings. 
The reader is then not required to examine the many original numbers 
from which the relation stems. If a meaning can be represented in two 
dimensional space it can readily be communicated to other investigators 
by means of simple rectangular coordinate graphs. , 

Graphing Functional Relationships. A functional relationship between 
two variables is readily depicted in a two-dimensional graph. Usually, 
rectangular coordinates are used as in Fig. 2. In a functional relationship 
one variable of the pair is considered independent because in some 
respect it has priority over the other variable, This priority may be deter- 
mined by the time of occurrence of the variables, the variable coming 
first being considered the independent one and the other the dependent 
variable. Priority may be determined by the degree of control we exert 
over the variables, In research, the experimental variable that is sys- 
tematically manipulated is the independent variable, In graphing a 
functional relationship it is customary to place the independent variable 
on the abscissa and the dependent variable on the ordinate, : 

A very familiar functional relationship in psychology is the learning 
function. Here we have the amount of progress associated with the 
amount of practice. Obviously, the more an individual practices, the 
better will be his performance, The nature of the rate of improvement 
with continued practice—whether the rate of improvement is the same 
throughout the several practice periods or is faster at one time than at 
another—should be decided only by an appeal to empirical facts. The 
learning curve of Fig. 3 was made up to represent improvement that 
might be manifested bya group of 10 subjects doing simple multiplication 


problems. The number of correct problems is a function of the number 
of practice sessions of 1-minute duration, 


Several meanings are revealed from 


at of reaching a limit beyond which 
achieve. § i 


the physiological limit, 
involved are working at 
Another meaning to b 


one argument states that the bodily mechanisms 
their maximum, 


obtained from most learning curves comes from 
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the irregularity of the achievements associated with increasing practice. 
It will be noted that the curve does not pass through all of the points. 
Although each point is the average of 10 performances, the points form 
a jagged or irregular line. The curve is smoothed by drawing the line 
through the plot of points rather than through every point. It is argued 
that through smoothing we obtain the form of improvement that would 


most likely occur under ideal conditions. 


80 


70 


60 


50 


40 


Problems solved 


30 


20 


10 15 
Practice periods 
Fic. 3. Learning curve for working simple multiplication problems. 
The Frequency Polygon. One of the simplest and most significant types 
of graphs in psychological research is the frequency polygon. This is a 
graph of the relationship between frequency of occurrence and score 
value, the same relationship as is represented in the frequency distribu- 
tion table. In the frequency polygon the scores are e first a Kon 
thought to condition the frequency of —. Ta = t we 
i ies are considered a function of the 
Placed or scissa. The frequencies are ci 
Ae ordinate. The word polygon 


size x a re placed on the £ 
of ts ca ane A Fig. 4 is a polygon; the many sides 


Means many sides. The solid line it ; 
are le iy drawing the trend line through every point. When the 
line is smoothed it is then referred to as a curve. The broken line n Hig 
4 is the frequency curve, and geometrically speaking it is considere 


to have an infinite number of sides. i 
An inspection of either line in the graph will give most of the mean- 


ings obtained from the frequency-distribution table and will give these 


226 Steps of the Scientific Method 


meanings with less search and with greater clarity. In addition, the gen- 
eral contour of the frequencies and the symmetricality of the frequencies 
on either side of the mid-point are much easier to detect in the graph 
than in the tabular distribution. The relative variation of the frequencies 
for different score units is an especially important characteristic. For 
example, if greater frequencies occur at either end of the range of scores 
than in the midrange, it usually is indicative of the operation of impor- 


30 


a 20 
3 
8 
8 
8 
S 
5 
2 
E 
5 
= 10 
0 z 
20 25 30 35 40 45 50 
Score on test 
Fic. 4. Example of a frequency polygon showing the effects of smoothing (broken 


line), 


tant factors, and it is a sign that further analyses are in order, When there 
are two regions of high frequencies, the curve is called bimodal. 

The Normal Frequency Curve and the Normal Probability Curve. The 
normal frequency curve has several special characteristics. It is like that 
of Fig. 4 in being a symmetrical bell-shaped curve with the frequencies 
decreasing from the middle to each extreme. It is not just any bell- 
shaped curve, however, but one in which the frequencies 
cording to a particular mathematical 


was drawn to approximate closely a n 

The theoretical curve obtained from 
the normal probability curve. The pa his curve was 
developed by the mathematician to Tepresent the expected occurrence 
of events having certain mathematical probabilities, A 
example would be that of predicting the fall of 100 penni 


decrease ac- 
law. The curve of Fig. 4 actually 
ormal frequency curve, 

the mathematical formula is called 
rticular formula of t 


good empirical 
es tossed 10,000 


. 


A 
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times. The events would be the different combinations of heads in the 
100 pennies, namely 100 heads, 99 heads, . . . , 1 head, and O heads. The 
formula enables us to predict how frequently each of these combinations 
of heads would occur in 10,000 tosses. If we actually performed the ex- 


periment we would obtain a frequency distribution that would closely 


approach the form of Fig. 4. 
ology is the fact that the distribution 


A most significant finding in psyc 
of occurrence of values for a very large number of psychological char- 


acteristics closely approximates the normal probability curve. This is 
often referred to as the “normal law.” Thousands of times the psychol- 
ogist has had the occasion to study the frequency of occurrence of the 
various values of his variables. In a very large proportion of these studies 
the variables have been found to be distributed according to the normal 
law. We then have the following two facts: (1) the mathematician has 
developed a formula which accurately describes a particular type of 
distribution function, and (2) the psychologist has empirically demon- 
strated that a large proportion of his variables follow this distribution 
function. On the basis of these two findings the psychologist then uses 
the mathematical characteristics of the normal probability function to 
analyze and describe his empirically derived normal frequency distribu- 


tions. 


The decision as to whether or not a given frequency distribution is 


normal need not be left to subjective judgment. A statistical test is avail- 
able for determining in a quantitative way if the empirical data distribute 
themselves according to the law of normal probability. If the frequency 
curve approaches sufficiently closely the mathematical curve, then we 
are justified in utilizing the characteristics of this curve in the analysis 


of the data. 
eviously described are often used 


Bar Charts. Although the graphs pr 
ommunicating meanings, there are some graphs spe- 
e. The bar chart and the column dia- 


rpose. Bars or columns are used on one 
endent variable, while the independ- 


for displaying and c 
cifically developed for this purpos 
gram are popularly used for this pu 


axis to represent amounts in the dep 
ent variable is arranged along the opposite axis. The dependent variable 
is placed on the vertical axis in the column diagram and on the horizontal 
axis in the bar chart. Comparisons among @ wide variety of variables are 
possible with these graphs. = 

Suppose, on a 5 Ta we wish to depict the relationship between 
performance on an intelligence test and college graduation. Figure 5 
illustrates this type of relationship. The performance scores on the test 
are placed on the vertical axis and the proportions of individuals gradu- 


ating from school are placed on the horizontal axis. The lengths of the 
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bars are made to correspond to the per cents of graduating students 
falling in the several score intervals. We can see from a glance at the 
chart that, in spite of some reversals in the increments from one score 
interval to the next higher one, there is a definite relationship between 
size of score and per cent of graduates: the higher the score, the greater 
the proportion of students succeeding. It will be noted that the relation- 
ship is not rectilinear but curvilinear, which can be demonstrated by 
passing a smooth line through the ends of the bars, From this curve we 
can note that what is being measured in the first few lower steps in the 
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0 10 20 30 40 50 60 70 80 90 100 
Percent graduating from college 
Fic. 5. Per cent of stud 


ents graduating from college as predicted by an intelligence 
test. 


test continuum is more significant in terms of accounting for failure to 
graduate than what is measured 


type of information depicted in 
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achievement, variation in response, and relationships between variables. 
The emphasis will be upon the meanings and purposes of certain nu- 
merical constants and not upon methods for computing them. 

The Mean as a Measure of Level of Achievement. In previous sections, 
we reviewed the use of numerical scores and large quantitative categories 
as means of describing the level of achievement. The present discussion 
will concern the mean as a measure of achievement. It can be used to 
represent an individuals performances or the performances of a group of 
individuals, The mean is the familiar statistical constant we compute by 
adding the amounts of several given items and dividing this total by 
the number of items. Suppose a student obtains the following scores in 
six examinations: 20, 21, 24, 22, 20, and 19. We decide that the mean will 
provide us with a reliable estimate of his average achievement. Adding 

126. Dividing this sum by 6 results in a 


the scores we obtain the total of 
mean of 21. We have reduced six performance scores to a number that 


can be used to represent all of them. 
One characteristic we desire the mean to have is that of representa- 


tiveness. Its function is to stand for and take the place of all of the items. 
It can be said that the mean represents each of the items of a group 
because the value of every item enters into its computation. In the above 
example every score contributed according to its value, eg» the score 
24 contributed 24, which is 5 more points than was contributed by the 
score 19. Logically speaking, if we have no reason for thinking that the 
score 24 is unduly large because of the operation of chance factors or 
other factors not concerned with the knowledge of the subject matter 
being tested, we shall not want to reject it or add it in at some 1 
value, but shall want it to contribute according to its own amount. We 
have here an exemplification of one of the fundamental rules of statistics, 
namely, that the investigator is not justified in rejecting any values of his 
variable unless there are sound empirical reasons—not personally biased 


reasons—for excluding them. 7 

Another characteristic desirable in any statistical constant is stability. 
By this is meant that its value will not be greatly sensitive to chance or 
sampling fluctuations. The mean is a stable measure. For 3 = = 
tested a given group three times in some ability that was 5 5 
main constant and the three testings were conducted un er pna 
mately the same conditions, then the three means obtained from the 


three administrations would have nearly the same value. 


istics that recommend its wide 
Th everal other characteristics t t 
ce Bets rty of being algebraically manip- 


adoption. It has the mathematical prope | 
ulable. We can add, subtract, divide, and multiply means as wo hen 
other numbers. This is one of its most important properties ut nee 
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be developed further in our discussions. The mean is also easily com- 
puted and is readily understood, two additional characteristics that make 
it a very serviceable constant to use. 

The Standard Deviation as a Measure of Variation. As noted in Chap. 
6, the standard deviation is a statistical constant that can be used to 
measure variation either in individuals or in groups of individuals. When 
we examine a distribution of scores depicting repeated performances of 
one individual or the single performance of each of many individuals, 
it is apparent that to measure the variability of the scores we must have 


=) Mean +1 +2 
+3 
Standard deviation scale 


Fic. 6. Illustration of standard deviation units in relation to the normal distribution. 
some central point from which the scores deviate. This is to say that a 
e point set up as a reference point 
deviation can be measured. In the 


measures. The standard deviation is 
represented by the Greek letter sigma (c). 
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number of problems worked, then SD is expressed as. a given number of 
problems worked. The actual value of the standard deviation is obtained 
from the data by applying a formula. A careful inspection of the curve 
will reveal that the inflection points—those points in the curve where the 
slope changes in direction—fall at — 10 and +Io from the mean. The 
distance between the points — lø and Ale from the mean includes 68 per 


cent of the scores. When laid off in both directions from the mean a value 


of 3c approximates the ends of the distribution. This is to say that ap- 


Group B 
. 


. A 


70 BO 90 100 
Score on test 

Fic. 7. Distribution of scores of two groups with similar means but with marked dif- 
ferences in variability. 


40 50 60 


proximately 100 per cent of the scores are included in the distance 
between — 30 and +30. 3 * 
A further characteristic of the standard deviation is that it is a constant 
unit of measurement. One sigma distance is the same along the base line 
of the normal curve regardless of the particular point in the range at 
umber of score 


which it is applied. That is, 1 sigma distance is the same n f sc 
units at the low end, in the middle, and at the high end of the distribu- 


tion of scores, because for a given group of scores the standard deviation 


is constant in value. a aA 
Another characteristic of the standard deviation is its represen al 

ness. In computing sigma the value of every score — 
The importance of the standard deviation as & means of chara 


ing groups of individuals can be pointed out by considering the ere: 
of comparing two groups in terms of their scores on i a 1 ie 
tion. Such a comparison is presented in Fig. 7. It will be n N 
two groups do not differ significantly in mean score, 11 0 tatni g 
a mean of 70 and group B a mean of 71. If we had only sane 7 
basis for comparing the two groups We would have to conclude a 
were almost identical. From the graph we can see that the po A 
very different except for their means. Group A is much more variable th: 


is used. 
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group B. The standard deviation of group A is 10, and that of group B 
is 5. We can say that group A is twice as variable as group B in its per- 
formance on the examination. This difference is clearly portrayed in the 
curves of Fig. 7. 

The Measurement of Relationship by Correlational Methods. Co-rela- 
tion or correlation refers to the fact that variables “go together.” Another 
way of expressing it is that there is a concomitant variation between the 
variables, or more simply “varying togetherness.” A coefficient of corre- 
lation is a single number that expresses the extent or amount of the rela- 
tion between two variables. It measures the extent to which v 
in one variable is associated with variation in another vz 


ariable. 
Correlation in its broader aspects include 


s many kinds of relationships. 
Many specific adaptations of the concept have been made and corre- 


sponding mathematical constants have been derived for quantifying the 
particular types of relationships that have been studied. We shall be 
concerned primarily with the correlation coefficie 
tinuous variables that are rectilinearly re 
is one in which the units or scores are divisible into any size required, 
that is, the variable being measured changes in magnitude by infinitesimal 
amounts, as in the case of intelligence. This is the type most widely found 
in the study of psychological variables. Correlation between two continu- 
ous variables will illustrate characteristics that are applicable in varying 
degrees to the correlation between discrete variables, dichotomized vari- 
ables, and variables related curvilinearly, 

An effective measure of relationship should reflect the 
gree of relationship between the variables. Some v 
lated with each other, others are 
are remotely related or unre 
in quantitative terms this variation in amount of rel 
tends from no relation on the one hand to perfect r 
For pairs of continuous variables we have such a procedure in the product- 
moment coefficient of correlation, also called the Pearsonian coefficient 
after the mathematician who derived the formula. It is symbolized by 
the letter “r,” 


ariation 


nt between two con- 
lated. A continuous variable 


amount or de- 
ariables are closely re- 
only moderately related, and still others 
lated. We need a procedure for expressing 
ationship which ex- 
elation on the other. 
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usually consist of two sets of measures on the same group of individuals. 
A point is plotted on the diagram for each individual by placing a dot 
at the intersection of imaginary lines extending vertically and horizontally 
from his scores in the two variables. 

Different amounts of relationship are represented in the three plots. 
For the sake of simplicity only a few points have been plotted in each 
diagram. In Fig. 8A we have what is termed a perfect relationship. The 
points all fall on a straight line and the change in one variable is directly 
al to the change in the other. When the values of two variables 
a for computing the coefficient of cor- 
he value of 1.00. In Fig. 8B 


proportion 
so related are inserted in the formul 
relation, the resulting coefficient will have t 


hree degrees of correlation. 


Fic. 8. Illustration of t 


the relation is less than perfect but there is discernible a definite or of 
— 8 i i ips the order dia- 

relation. sients of correlation of relationships of 

Sine e neighborhood of .60. Figure 80 


grammed in this figure have values in th 
demonstrates a very low correlation. It will be noted that the divergence 


of the points from the trend line is much greater than in the other dia- 
grams, but there is still a slight indication of a relationship between 
the variables. Coefficients of correlation of relationships of this order have 


values around .10. 

It can be seen from these di 
sented by the extent to which 
straight line. This line is known as t 


amount of relation is repre- 
the points deviate from the best-fitting 
he line of relation. It is further noted 


that the meaning of amount of relation varies e as 85 von 
tionship varies from the point of no relation to t he poin 0 per 1 mrs 
tion. The product-moment coefficient of spew 1 ria i 
continuous change, and its value varies continuous y 5 5 5 ai 

Negative values of the correlation cor iri 5 v 5 — 
trend of relationship is opposite to that 1 in the ne 1 
high scores in one variable are associated with 1 a he f = 
the trend will be from upper left to lower right, an 58 n 
be a negative number. In the study of psychological varia 18 : a 
are interested in knowing if effective behavior in one variable 1 


agrams that the 
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ciated with effective behavior in another; that is, that “goodness” of per- 
formance in factor A goes with “goodness” of performance in factor B. 
If in all of our variables we represent goodness of performance by high 
values and poorness of performance by low values, then genuine negative 
relationships between psychological variables are seldom found. For in- 
stance, relationships among human abilities are near] 
positive in nature. 

The amount of relationship indicated by a coefficient may be in error 
relative to the “true” amount of relationship existing between the vari- 
ables correlated. In interpreting the meaning of a coefficien 
tion we must remember that the coefficient that we compute r 
amount of relationship existing between the scores that are inserted 
into the formula. We are primarily interested, of course, in the relation- 


ship between the psychological variables underlying the scores. There- 
fore, the scores must be reliab] 


obtained through the coefficie 
ables. At times we are ignora 
determiners operating to infl 


y always found to be 


t of correla- 
measures the 


empirical evidence. 
It should be clear from th 

correlation gives us a treme 

ships, whether they are f 

tionships, the coefficient adds information about the de- 

terminant relationship existi 

relationships, the coefficient m 


e foregoing discussions that the concept of 


ndous advantage in understanding relation- 
unctional or nonf i 


other, but the high correlation be 
common determiners of the two variab 
mental growth processes of the body. 
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computations underlying the concepts presented. Topics covered in the 
discussions include the frequency distribution, tabular methods, measures 
of relative position, averages, measures of variability, the normal distribu- 
tion, and the concepts of regression and correlation. 


CHAPTER l1 


Generalizing from Scientific Data 


Generalizing is the last step in the conduct of a scientific study, the step 


wherein we evaluate and apply the experimental findings obtained from 
our investigation. Our interest is in learning wh 


ings have for the hypothesis we are studying ar 
problems which differ from those of our study. 
ing and stating the implications of the results 
us to conceptual elements and characteristics that are logical extensions 
of the empirical facts of our investigation. Forecasting to other situations 
by extending the empirical findings contributes valuable solutions to 
problems of both practical and theoretical significance. 

As the value of scientific research is a direct function of the range of 
accurate generalization that is achieved, we shall profit greatly from a 


consideration of the types of errors that must be avoided if sound gen- 
eralizations are to be formulated, 


at significance these find- 
ad for other purposes and 
This process of discover- 
for our hypothesis leads 


THE PURPOSES SERVED BY GENERALIZATION 


Knowledge grows because the legacy of understanding left from the 
past is insufficient to solve all of 


our present-day problems. Growth in 
knowledge comes about by the addition of small increments of fact to 
what is already known. We effect this growth by studying hypotheses and 
generalizing about new facts collected to confirm or disconfirm these hy- 
potheses. 


e many purposes, sO 


the solution of other problems than 
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their own. If a study has no meanings that can be extended beyond its 
immediate results in order to solve related problems, it must be judged 
as of little worth. 

Any research study should be evaluated in terms of the number of 
significant questions it raises as well as the number of problems that it 
solves. Seldom are the data discovered in an experiment so comprehen- 
sive in nature that they describe all aspects of the problem and provide 
answers to all questions relevant to the problem. In fact, scientists con- 
sider projects to have great value when they are pregnant with new 
problems and new questions. Questions raised by a study actually extend 
the findings of the study and lead to their application in new situations. 
An important function that the scientist can perform for his fellows is to 
point his generalizations toward unanswered questions and develop the 
generalizations with such clarity and detail that other scientists will be 
encouraged to embark on a search for the answers. 

An illustration will clarify the foregoing points. A situation in psy- 
chology in which generalization might be severely restricted is one in 
which the conclusions merely list the characteristics of the sample of 
subjects studied. For example, we could set up the problem of deter- 
mining the intelligence, reading ability. spelling ability, and social 
adaptability of the students in Miss X’s eighth-grade class in school Y. 
With the data collected, our conclusions could simply be a description 
of these characteristics of this particular sample. Obviously, we have 
solved the problem but we have not extricated all of the meanings in- 
herent in the data. 

We could use the information for making various decisions about 
this particular group. If the class were deficient in reading ability, we 
Could look to the intelligence test scores for a possible answer. If the 
class contained a group of mischievous students who were doing well in 
their lessons but who, nevertheless, were responsible for most of the 
Classroom disturbances, we could determine if they were exceptionally 
right and if they scored low on the social adaptability test. Thus, rela- 
tionships among these several attributes might provide leads to the ex- 
Planation of some of the behavior of the group or of individual members 
of it. The findings would thus be extended as trial solutions to other re- 
lated problems. 

Questions of some importance might arise if we carefully analyzed the 
results. For example, questions could be raised concerning the nature and 
extent of the relationship between intelligence and each of the other 
attributes, the nature and extent of the relationship between the social- 
adaptability scores and the frequency of classroom misconduct, and simi- 
lar relationships. These questions would issue in problems requiring 
urther research. 
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Similarly, we could use the information for generalizing about the be- 
havior of other groups of eighth-grade students, and probably for gen- 
eralizing about the behavior of students in grades other than the eighth. 
Questions similar to those mentioned in the preceding paragraphs could 
be asked about these other groups. 

Generalizing as a Means of Advancing the Application of Knowledge. 
Generalizations setting forth problems yet to be solved can be directed 
toward applications in practice or applications in theory. We should not 
think of these two phases as unrelated but rather as mutually interrelated 
and interdependent. Growth in practical applications arises from previous 
accomplishments in theory. In turn, practical applications reflect upon 
theory to reveal additional needs for theoretical development. 

Advancing Practical Applications. Findings of the theoretical scientist 
arouse the interest of the practical scientist and lead him to make generali- 
zations that eventually result in some technological convenience. Obvi- 
ously, if no one had ever thought beyond the findings of the theoretical 
scientist, and pondered over what practical use could be made of these 
findings, we would not now have the many applications of science that 


are enriching all phases of physical and psychological living. Each time 
a new technological convenience has been provi 


been an insightful generalization from preexistin 
stagnate, practical applications would shortly c 
Many theoretically bent scientists refuse to consider the practicality 


of their research findings. This would appear to be an extremely short- 
sighted policy. For instance 


ded, there previously has 
g theory, If theory should 
ease, 


mit such insig 
following up 


may even be led to valuable ex 
nature of the unknown constituents. 
Generalizing for theoreti 
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upon the further development of theory. Thus in many instances the 
practical situation provides important problems for the development of 
theory. 

For the good of both future theory and future practice, present theory 
should be maintained in a vigorous state. 


THE PROCESS OF GENERALIZATION 


Through statistical and logical analyses and interpretations, we dis- 
cover the meanings inherent in the data we have collected. Through gen- 
eralization, we extend these meanings to situations differing in some 
Significant aspect from the situation in which we evolved them. The 
accuracy of the generalization will depend upon our correctly extracting: 
the meanings from the data and upon our discovering in the new situa- 
tions an adequate number of the determinants underlying these meanings. 

The Probabilistic Nature of Generalization. By this is meant that there 
is no certainty that a given generalization will hold true. On the basis 
of the available evidence we decide whether or not we are willing to 
make the generalization, realizing, of course, that we can be found in 
error. The meanings available to us have been obtained from evidence 
that is formally incomplete. That is, the meanings have not been obtained 
under exactly the same set of conditions required by the generalization 
We are making. For example, we may have based our evidence on a 
Sample of subjects and wish to generalize to the population. Or we may 
have studied one or two kinds of expression of a psychological process— 
e.g., learning—and wish to generalize about all forms or manifestations 
of this process. Or we may have studied a part of a certain complex 
Work procedure—e.g., assembling a radio chassis—and wish to generalize 
about the procedure of assembling the entire radio. In each of these in- 
stances there are unknowns not covered by our evidence. Our inferences 
must be the most likely ones, the ones best substantiated in the data 
already collected. But they will always contain some uncertainty; there 
will always be a guessed portion that can never be completely removed. 

Y careful organization and analysis of the data we reduce the amount 
of the uncertainty and form an estimate of the probability that our gen- 
eralization will be correct. 

The Nature of the To-Be-Generalized-To Situation. The Problem. In 
Order for the meanings discovered in one situation to hold true for an- 
Other situation, it is necessary for the two situations to have in common 
the determinants of these meanings. Suppose we have developed an apti- 
tude test which successfully predicts graduation from a liberal arts col- 
ege, and we generalize that it would be equally effective if used in a 
College of engineering. We assume that the to-be-generalized-to situation 
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(graduation from engineering college) will be determined in N15 ba 
same way as the situation that we have studied (graduation from ¢ D 
arts college). Without this assumption there would be no l basis 
generalizing. We cannot generalize in a universe “ruled by caprice. — 
We must juxtapose the pattern of determinants of the experimenta m 
the to-be-generalized-to situations and make judgments about the 125 1 
spondence between the two in terms of the meanings to be generalized. 
This is difficult because the new situation to which the generalization 
is to be applied is not a known situation in the sense of occurring at = 
time. It is a postulated future situation. It must be patterned on the past, 
but it must have certain new characteristics demanded by the generaliza- 
tion. Uncertainty is introduced in endeavoring to describe a future Siler, 
tion containing the essential characteristics required by the generalization 
and also having a high likelihood of occurring. In our example the Hey 
and uncertain element is whether the aptitudes needed for completing 


the work required by engineering colleges are those measured in our 
test. 


There is the further problem of cle 
ence in the to-be-generalized-to situation, that is, the reference point to 
which the generalized meanings are to be applied. Points of reference to 
which generalizations may be directed are of many varieties, but for 


purposes of description they can be subsumed under three kinds, namely, 
persons, facts, and principles. 


Generalizing about Pe 
predict the characterj 


arly describing the point of refer- 


rsons. The purpose of many generalizations is to 
stics and behavior of parti 
of persons, Having studied the attributes of 
formulated generalizations that contain the 
as a whole, we may then proce 
subsequent behavior of the 


cular persons or groups 
a group of individuals and 
characteristics of the group 
ed to apply the generalizations to the 


Sroup or of any individual within the group. 
Or again, we may apply the generalization to similar individuals or to 


similar groups of individuals. In these latter instances the generalization 
can be accurate only if the determinants underlying the generalization as 
found in the population studied also operate similarly in these other 
individuals or groups. In evolving the to-be-generalized-to situation we 
must describe the characteristics of the persons studied and the persons 
to whom the generalization is being referred, 

Generalizing about Facts, Generalization m 
of deducing a fact that is not d 
we have marshaled and org; 
that there is still a missing 
filled before we e 


ay serve the useful purpose 
irectly observable, In a given study, after 
anized all the facts We possess, we may find 
here is a blank that must be 
leralizing about the nature of 
ure. This function of generali- 
Ctive stories. The reader often 
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has to generalize about missing factual aspects of the story before he 
can reach a conclusion concerning the identity of the person who commits 
the crime. 

Generalizing about Principles. The common goal of all scientific psy- 
chology is to discover fundamental principles underlying behavior. One of 
the primary purposes of generalizations is to develop and expand explana- 
tions of behavior. When a generalization involves principles, it is often 
difficult to evolve an accurate and comprehensive description of the to-be- 
generalized-to situation. 

A generalization about explanatory principles may be made without 
regard to any specific population to which it might apply. The generaliza- 
tion is directed toward an understanding of the psychological processes 
involved in the behavior under study, and there is little concern given 
to the characteristics of the subjects used in obtaining the facts. For 
example, suppose we wish to study the rate of forgetting as a function of 
time since learning, and we use volunteer subjects for the experiment. We 
are selecting the subjects with no particular to-be-generalized-to popula- 
tion in mind. After we collect and analyze the facts, our generalizations 
are referred to principles and not to persons. That is to say, our conclu- 
sions are about the rate of forgetting irrespective of the particular per- 
Sons who volunteered as subjects. Certainly one of the primary purposes 
of scientific psychology is to discover laws of behavior without in any 
important sense referring these laws to particular kinds of persons. This 
kind of reference point is very frequently used by the investigator in theo- 
retical psychology. 

Evolving the Meaning 
Complex array of interrel 
determinants that underlie the meanings to be generalized. In any study 
of behavior, many determinant factors will be operating. Some of these 
factors will be understood, others only partially understood, and still 
Others wholly unknown. Some of the factors will be relevant to our prob- 
lem while others will be irrelevant. Some of the factors will be under 
adequate control, others partially controlled. and still others uncontrolled. 

espite this complexity we must learn what the determinants are, whether 
they have operated in a characteristic or representative manner, and 
what the relative importance of each is. We must also make sure that our 
description of the determinants is accurate and represents the objective 
Situation with a high degree of fidelity. 

Abstracting the Meanings. As we learned in Chap. 10, the meanings 
that underlie generalizations are extracted from the data by dint of many 
gical and statistical analyses. These meanings tend to be abstract in 
Nature, They are derived from the facts obtained in our investigation 
Coupled with whatever other relevant knowledge we have. Primarily they 


s to Be Generalized. The Problem. From a very 
ated conditions we must evolve the pattern of 
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are abstracted and conceptualized from the meanings that are readily 
observed in our data or can be readily traced to the data. Meanings that 
are immediately available from superficial inspection of the data are 
seldom of great significance except as starting points from which to 
generate more highly conceptualized meanings. 

Success in applying a meaning to a subsequent situation rests in part 
upon freeing the meaning from much of the particular immediate context 
of the experiment in which it was discovered. If we find that an essential 
determiner of the meaning cannot be freed from the particular context of 
the experiment, generalization is impossible. By this is meant that if the 
meaning to be generalized results from some determiner unique to the 
investigation in which it was discovered, that is, a determiner that can- 
not be reproduced in a different situation, we lack the basis for making 
a generalization. A few characterologists have achieved some success in 
diagnosing personality because of their personal understanding of people. 
A false generalization is made when they attribute this success to some 
system they have devised, e. g., phrenology. The system in the hands 
of another does not bring success because the essential determiner, the 
knowledge of the characterologist himself, is not transferable with the 
system. To be successful, a meaning to be generalized should stem from 


those empirical conditions that we Suppose can be duplicated in subse- 
quent situations, 


anings. He is 
as from which 


e meanings can be safely abstracted, He is the one most familiar with 
all of the conditions and findings of the investigation, and presumably 


© extent to which they can be 
e may be still others that are 


ar cannot be reproduced unless 
the original procedures are adhered to in a very rigorous manner. 


Generalizing from Scientific Data 243 


It is the scientists task to discover and describe the essential determi- 
nants—those determinants that can be reasonably assumed to underlie 
the meanings to be generalized. Furthermore, he should point out any 
conditions that prima facie appear likely to elicit the meaning and yet 
will not. Obviously, when there is a patent error possible in generalizing, 
the scientist must warn others of it. This he does when describing the 
essential determiners of the meanings. 

The Need for Unambiguous Description of the Meanings. A very im- 
portant task is to describe the origin and development of the meaning 
to be generalized in order that other investigators may learn just how 
much abstraction and conceptualization were needed to obtain it from 
the available facts. Any given meaning may remain closely associated 
with the empirical facts from which it was derived, or it may be found 
removed from these facts by several steps of logical abstraction. If an- 
other investigator is to adopt the meaning in a changed context, it is 
necessary that he learn in detail how the meaning was evolved. A thor- 
ough description of the facts upon which the meaning depends will go 
far toward enabling him to utilize it correctly under a changed context. 
An example of a failure to give an unambiguous description occurs when 
a scientist derives a meaning through certain mathematical formulations 
but fails to describe these formulations accurately when he presents the 
meaning, 

The Mechanics of Generalizing. The Problem. To generalize a meaning 
accurately we must show that its essential determinants can be repro- 
duced in the to-be-generalized-to situation. This we may be able to do if 
We have an accurate knowledge of the characteristic meanings occurring 
under the new set of conditions. Our problem is to match the determi- 
nants of the abstracted meanings with factors in the new situation. 

Not all of the variables of the experimental situation should be repro- 
duced in the to-be-generalized-to situation. Actually, the value of a 
8eneralization is a function of the extent to which differences are intro- 
duced between the two situations. That is to say, the greater the differ- 
€nce between the two situations, the further the generalization reaches 
into the unknown. It is also true that the greater the difference between 
the two situations, the fewer the common determinants underlying the 
Seneralization. It is, then, very necessary to determine the relative im- 
Portance of the common factors. 

Similarity of Determinants as the Basis for Generalization. Similarity of 
€terminants is basic to the formulation of a generalization. We are re- 
‘erring to the similarities between the known situation and the to-be- 
Seneralized- to situation. Our task is to study the similarities or likenesses 
of the two situations and frame the generalization in terms of the kind, 
Number, and amount of the common determinants. Which determinants 
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must be the same and which can be different will be a function of the 
nature of the generalization that is formulated. Similarity of obi aaa 
merely refers to the fact that some of the determinants must be the same. 

Other things being equal, the greater the number of common determi- 
nants, the more likely is the generalization to be found valid. Of course 
other things are not equal, and the number of common determinants 
needed for realizing a given generalization will then be a function of the 
relative importance of the common determinants underlying the meaning 
to be generalized. 

The amount of the contribution of any determinant must also be as- 
sessed. Some determinants will contribute more than others, and some 
determinants will contribute more under one set of conditions than under 
another set. When we compare the experimental and the 
ized-to situations, merely learning the presence or absenc 
determinants is not a sufficient goal. 

It should be obvious that the two situations can be alike in respect 
to characteristics that are not determinant in nature, There may be 
similar aspects that are not part of the nexus of determiners basic to 


our generalization. Errors will result if we base our generalization on 
these nondeterminant similarities. An error of this type is described on 
p. 256. 
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in the new situation the functioning of which appears to be related to 
the generalization but in fact is not. 

The Process of Generalizing for Practical Purposes. The Problem. In 
this type of generalization we are usually concerned with a to-be-gen- 
eralized-to situation that can be devised by making changes in present 
situations. We can therefore obtain an immediate check on our generali- 
zation. The new situation is of a practical nature, and the generalization 
is concerned with meanings that can be readily introduced into this 
situation. 

Suppose that a practical problem arises for which an immediate solu- 
tion is needed. Our task is to devise an experiment in the hope of finding 
a solution. Because of the practical nature of the problem, we can obtain 
suggestions from the empirical situations in which the problem is found. 
We can find aspects of our problem in these situations, and from them 
we can get hints about the characteristics of the solution. In fact, some 
of the important features of the solution are stipulated by the demands 
of these situations. The generalizations from any study we complete can 
be referred back to these empirical situations to determine their validity. 

An Example. Suppose we are approached by the manager of a factory 
with a request to develop a battery of tests for selecting workers for oper- 
ating bench lathes. Our task is to devise psychological tests by which we 
can predict the abilities required for the operation of these lathes. At the 
end of our study the major generalization would be that applicants hav- 
ing the particular combinations of abilities we had evolved would be 
Successful bench-lathe operators. 

The generalization is directed to a future practical situation much like 
an existing situation. The to-be-generalized-to situation involves the oper- 
ation of bench lathes as it is practiced in this particular factory. This 
Situation is knowable for our purposes because it can be considered to be 
the present situation of operating bench lathes in the factory. Before de- 
vising any tests we are able to conduct analyses that reveal the determi- 
Nants of lathe operation under present factory conditions. By gaining 
knowledge of these determinants we thereby gain knowledge of the de- 
terminants of the new situation to which we wish to generalize. 

Our generalization can be rigorously tested because we are general- 
izing back to a factual or a to-be-factual situation. Predictions can be 
made of the success in operating bench lathes of the applicants we test, 
and these predictions can be checked in terms of the subsequent success 
of the employees on the job. 

We do not wish to give the impression that this task is an easy one 
because there is an actual objective situation in which many of the 
Psychological determinants can be found. Actually, this situation com- 
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prises a complex global type of behavior which is difficult to analyze and 
describe but which we must understand to a significant degree before we 
can proceed with the construction of the tests. Once the determinants 
are known and the tests constructed, however, the remaining procedures 
are straightforward and require no departure from well-established em- 
pirical methods. 

The Process of Generalizing for Theoretical Purposes. The Problem. 
This type of generalizing is the one used by the scientist interested in 
theoretical problems. He is primarily concerned with the advancement 
of knowledge in terms of explanatory concepts. There is usually no cur- 
rent practical situation conforming to the characteristics of the to-be- 
generalized-to situation. Inasmuch as his purpose is to gener: 
the experimental situation from which he obtained his fact 


hypothetical conditions that form the to-be-generalized-to 
course, this situation ma 


form for the purpose of 


alize beyond 
s, he sets up 
situation. Of 


ose responsible for the experimental findings. 
‘ ew situation must differ from the experimental 

amounts stated in the generalization. Signifi- 
only when the determinants of the hypotheti- 
om those of the experimental situation, The 
two situations differ are deliberately planned 
nce theory, They are either explicitly expressed 
c statement of the generalization. The generali- 
zation is then formed on the assumption that the findings will hold true 
for a new situation that is not an exact duplication of the experiment in 
which they were obtained. The to-be-generalized-to situation is created 
to show how the propositions evolved in the experiment can be applied 
under a stipulated set of changed conditions, 

The differences betwe 
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and extent of the differences will be determined by the nature of the hy- 
pothesis or theory that we are advancing. 

An Example. This type of generalizing is illustrated by generalizations 
in which the findings from one research area of behavior are extended to 
explain behavior in another area. An example is the adaptation of con- 
ditioned-response learning theory to the phenomenon of serial learning 
of nonsense syllables. 

One generalization made from conditioning experiments using animal 
subjects is to the effect that if an indifferent or neutral stimulus is pre- 
sented together with an adequate stimulus to which the animal is making 
an appropriate response, the indifferent stimulus will become effective 
in eliciting much the same kind of response. Thus, in the conditioning 
experiments using the hunger drive, the bell stimulus becomes an ade- 
quate stimulus for arousing the salivation response which normally is 
associated with the meat stimulus. 

Let us apply this idea to the serial learning of a list of nonsense syl- 
lables.* Suppose the human subject is presented with a list of syllables 
to learn, On the first trial he is shown the list, one syllable at a time, 
and is asked to pronounce each syllable. On subsequent trials he is shown 
the first syllable and asked to recall the second. Whether he recalls it or 
not, the second syllable is shown and he is asked to recall the third. This 
Procedure is continued until the subject can anticipate every syllable, 
that is, he can pronounce it before it is actually shown. 

The question now arises: What stimulus is operating to evoke the recall 
of a syllable by the subject before the syllable is actually presented to 
him? One explanation is based on the conditioned-response theory of 
learning. The first response of vocalizing a syllable after it is shown 
(trial 1) can be explained as dependent upon old well-established 
language habits. Vocalizing the syllable before it is presented (trials 
ollowing trial 1) is dependent upon the auditory and kinesthetic sensory 
experiences that arise from the preceding vocalized response. This is to 
Say that the vocalizing response itself elicits auditory sensory stimuli 
(the heard syllable) and kinesthetic sensory stimuli (the muscular feel 
of the throat mechanisms in pronouncing the syllable). These sensory 
experiences occurring at the same time as the actual visual experience 
of the next syllable will eventually function as stimuli to elicit the next 
Vocalized response. 

Clarification of this extension of the conditioned-response theory can 
be obtained by reference to Fig. 9. The first visually presented syllable, 

1; arouses the first vocalized response, VR. This VR, response gives rise 
to auditory and kinesthetic sensations indicated as aksı. These aksı ex- 


di ? This example is adapted from W. M. Lepley, Serial Reactions Considered as Con- 
itioned Reactions, Psychological Monographs, vol. 46, no. 205, 1934. 
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method that may result in inaccurate generalization can be classified 
in three broad categories roughly corresponding to three stages in the 
conduct of a scientific investigation. These stages are the deductive 
elaboration of the hypothesis that is to be studied, the execution of the 
experimental procedures of the empirical testing situation, and the ex- 
tension of the meanings to similar behavior situations. 

Examples of different types of errors from these three stages will be 
described in the following sections. A given inaccuracy in generalizing 
may be attributable to more than one type of error. The particular errors 
that are discussed are not then to be considered completely independent 
of one another. 


ERRORS ARISING IN THE LOGIC OF DEVELOPING 
THE HYPOTHESIS 


We shall be concerned here with errors of logic that occur during 
the developmental phases of our problem. We cannot expect to make valid 
generalizations if in devising and elaborating our hypothesis we fail to 
introduce the significant determinants of our problem and to establish 
these determinants in such logical relationships as will reveal meanings 
essential to a solution. 

Errors Due to Selecting Unimportant Implications. The Nature of the 
Error. For a hypothesis to be confirmed, we must discover evidence for 
the existence of the elements, relationships, or conditions that we postu- 
late in the underlying implications. In a previous chapter it was pointed 
out that hypotheses are based on certain assumptions or presuppositions. 
These assumptions form the starting point for the implications we derive 
from the hypothesis. 

We can expect that most hypotheses will have several implications and 

that these implications will vary in respect to the significance of the re- 
lationships that they bear to the hypothesis. Some implications will in- 
volve only a few aspects of a hypothesis, while others will embrace 
Many aspects. We may just happen to choose an implication that readily 
develops into a testable theorem but that adds evidence of no great sig- 
nificance for the hypothesis. We are not then justified in generalizing 
that the hypothesis has been confirmed. 
, It is not to be expected that we shall always be aware of the relative 
importance of the different implications issuing from a hypothesis. 
Scientists of repute have committed errors of generalization by not dis- 
Covering and developing the more significant implications of their hy- 
botheses. It is to be emphasized, however, that the error is minimized 
When we perform thorough theoretical and factual analyses of our 
Problem, 
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An Example. To illustrate this type of error, let us refer to a rather 
general hypothesis that underlies much of our thinking about human 
behavior, the hypothesis: Personality is a function of bodily mechanisms. 
Let us accept this phrasing of the hypothesis as a first statement, but with- 
out the intention of defining it more precisely. We can derive many im- 
plications from this hypothesis. For example, personality is a function of: 


The complexity of organization of the bodily mechanisms 
The speed of reaction of the bodily mechanisms 

The physiological integrity of the bodily mechanisms 

The size of the bodily mechanisms 

The shape of the bodily mechanisms 

The chemistry of the bodily mechanisms 

The electropotential characteristics of the bodily mechanisms 


A common generalization underlies all of these implic 
behavior is the functioning of organic tissue, and t 
havior is a function of vari 


ations, namely, that 


present reasons for and again 
of present knowledge, however, 
tive for personality theory th: 


nadequate Theorem. The Nature of the Error. 
is meant that the theorem selected 
cally to the implication from which 
here assuming that we have selected 
ut that the theorem we decide to test 
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not represent the conditions of the implication as adequately as would a 
theorem involving one of the glands, such as the adrenals, thyroid, or 
pituitary. Decayed teeth, although affecting personality responses, are 
not as basically related to these responses as are pathological conditions 
in the endocrine glands. 

Errors Due to Using a Testing Situation That Does Not Logically 
Represent the Theorem. The Nature of the Error. Errors of generaliza- 
tion occur when the experimental or other situation that we devise does 
not logically develop from the theorem. We are presuming that a sig- 
nificant implication has been found and that a theorem has been de- 
veloped that adequately represents the basic elements of the implication. 
We must then plan such conditions for the empirical situation as will 
logically allow the elements and relationships of the theorem to function 
in this situation in the manner required by the theorem, If they do not so 
function we cannot accept the test as a comprehensive check of the 
theorem. The test may contribute some useful information, but the ob- 
jective of confirming the hypothesis is not achieved. We are then not 
justified in generalizing that a hypothesis is confirmed when the condi- 
tions demanded by the theorem are not adequately reproduced in the 
test. 

An Example. Suppose we decide to devise an empirical testing situa- 
tion for the theorem: Traits of excitability are functionally related to ex- 
cessive secretion of the thyroid gland. “Traits of excitability” are now a 
More restricted and definitive expression of what in the hypothesis we 
called “personality.” “Excessive secretion of the thyroid gland” is a more 
restricted and definitive expression of what in the hypothesis we called 
“bodily mechanisms.” We can err by not introducing into the testing 
Situation representative measures of “traits of excitability” or representa- 
tive measures of “excessive secretion of the thyroid glands.” For example, 
if we used a pencil-and-paper test of introversion-extroversion we would 
mot adequately measure traits of excitability. The argument could be 
advanced that we would still be measuring personality traits, but this 
18 beside the point. We have selected a definite theorem to test, which 
involves traits of excitability, and a measure of introversion-extroversion 
does not adequately represent this personality phase of the theorem. 


ERRORS ASSOCIATED WITH THE REPRESENTATIVE 
FUNCTIONING OF THE VARIABLES 


Assuming that we have accurately and comprehensively elaborated 
our hypothesis and have made no mistakes in logic in devising an empiri- 
cal testing situation, we may still make errors in the actual execution of 
the test. These errors are not errors of logic. They are concerned with 
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our failure to achieve in the empirical situation the expression of the vari- 
ables that the logical development of our hypothesis has shown to be 
necessary. 

Errors Resulting from Distortion Introduced by Extraneous Variables. 
The Nature of the Error. In order to investigate a hypothesis, we must 
devise conditions that will force the experimental variable to function 
according to a prearranged plan which conforms to the demands of the 
theorem. The theorem contains the elements and rel 
the hypothesis is translated from conceptual events 
Any distortion in the experimental variable means a 
correctly these elements and relationships, 

The functioning of the experimental v. 
by the presence of unwanted interferin 
These interfering variables may either f: 
of the experimental variables, 
hibitory, may vary in terms o 
effects of these interacting v 


ationships by which 
to empirical events. 
failure to represent 


ariables is frequently distorted 


actors. It is then impossible to separate 
om the contribution of the experimental 


be explained by (1) supposing that the ex- 


i supposing that the experimental 
variable was inhibited from functioning 


by stronger opposing variables, 
or (3) supposing that the experimental variable had an effect opposite 
to that intended but that this effect was canceled out by the positive effect 
of the other vari 5. Parent that if interacting variables 
om functioning in the manner pre- 
: ny generalizations based on the distorted 
expressions will be invalid. 


experimental variable. The theorem under test was that deficiency in a 
certain vitamin B complex, by 

ability, would increase the time 
as the diet variable was directly consumption, the use of 
food as an incentive for learning the ordinary alley maze was considered 
inappropriate. Instead, a water maze was used. The rat was lowered into 
the water and was required to swim through the maze in order to get 
out of the water. From the beginning of the maze running, the experi- 


ct upon mental 


arning a maze, Inasmuch 
related to food 
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menter observed a great reluctance on the part of the diet-deficient rats 
to enter the water and considerable trouble by many of these rats to swim 
the course of the maze. The control rats manifested these characteristics 
to only a minor degree. These observations led to the conclusion that the 
behavior of the deficient rats in the water maze was reflecting motiva- 
tional factors and motor strength and coordination factors in addition to 
possible mental-deterioration factors. It was impossible to isolate and 
control the motivational and motor-coordination factors, so the experi- 
ment was discontinued. 

Errors Due to Generalizing from Insufficient Data. The Nature of the 
Error. Let us assume that the development of the theorem is logically 
sound and that an adequate test has been devised. Errors due to insuffi- 
cient data may arise if we do not collect enough facts to give our theorem 
a fair empirical testing. We prematurely stop the test before collecting 
enough facts to make possible a statistically sound evaluation of the 
theorem. Any generalization will then be formulated prematurely. It 
may be correct or it may be incorrect; there will not be enough data to 
determine which it is. When the data are insufficient, generalization 
should be held in abeyance until further facts can be collected. 

An Example. Generalizations based on insufficient data are sometimes 
found in investigations in which traits of personality are associated with 
anatomical features of the body. Types of criminality have been linked 
with head shape, personality traits with body proportions, psychological 
temperament with body chemistry, etc., with little statistically sound sup- 
Porting evidence. 

Suppose we study a sample of 36 criminals and discover a positive 
association between head shape and type of crime committed, as indi- 
cated in the left half of Table 5. It is seen that extortioners tend to have 


Table 5. Hypothetical Distribution of Head Shapes in Six Hypothesized 
Subsamples of Persons 


Type of crime Randomly selected subsamples 


Head shape = 


Extortion | Theft | Embezzlement I Il Ill 
Round 6 3 3 5 2 4 
Square. oa 3 6 4 2 7 3 
Egg-shaped.... 1 3 7 3 3 7 


round heads, thieves square heads, and embezzlers egg-shaped heads. 
With such a few cases, however, we cannot be assured that these distri- 
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butions are not artifacts of our sampling method. Now suppose we have 
a noncriminal sample of the same racial stocks, ages, sex, etc., as the 
criminal sample. We randomly draw three subsamples of the same num- 
ber of cases as we have in the criminal subsamples and obtain the results 
given in the right half of Table 5. Again we see a tendency for each 
head shape to be concentrated in a given subsample. If the subsamples 
really were randomly selected, results like these would offer sound 
evidence for rejecting any functional relationship between head shape 
and type of crime. Actually, 
nals and hundreds of noncri 


a in the general activity level of 
Errors Due to Failure to Exploit 
This can be described Pout the Data. The Nature of the Error. 


I 
| 
' 
| 
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not drawing a positive generalization when such a generalization is sup- 
ported by the findings. He is unwilling to credit the results to his hy- 
pothesis unless he can do so at a very high level of confidence. But in 
doing this he may declare a hypothesis unconfirmed when actually it is 


26 Wan 


Revolutions of coge reduced to 
special logarithmic base 
3 


1 3 5 7 9 i 1 is 7 e 2] 23 25 27 29 
Days of observation 
Fic. 10. Mean activity scores by days of vitamin-B-deficient and normal rats as 
Measured in activity cages. 


confirmed, Sometimes this overconservatism is interpreted to mean that 
an error is being made on the “safe side.” Certainly, declaring that a 
hypothesis is not substantiated when actually the data confirm it is not 
erring on the safe side. 

To clarify the foregoing arguments it will prove profitable to review 

rieſly the probabilistic nature of generalization. In earlier sections it was 
explained that with the accumulation of favorable evidence hypotheses 
change into laws; that laws are not to be accepted as certainties; and that 
with the accumulation of sufficient negative evidence laws may revert 
to the status of hypotheses. If we accept these interpretations, then the 
Point at which a given hypothesis is considered confirmed rests upon the 
Subjective judgment of the scientist. 

Scientists have arbitrarily decided to utilize two confidence levels in 
describing their findings, the 1 per cent level and the 5 per cent level. 
These levels refer to the probability of their results arising from the 
Operation of chance or unsystematic factors. Frequently these levels of 
Confidence are included in the statement of a generalization. This is to 
Say that an experimenter will often state that “at the 5 per cent level of 
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confidence we are assured that under M conditions X consequences will 
not occur from the operation of unsystematic factors. — 
The question may be asked: What level of confidence shoulc a r 5 om 
before a generalization is made? The answer depends upon t te 2 
or objective of the scientist. In a preliminary study, a 10 per cen 0 ae 
confidence often proves sufficient. If the generalization involves = — 
of life and death, then a very conservative attitude is justified, me : 1 
accepting a generalization only if it is accounted for by chance at the . 


per cent level. For most scientific problems, the 5 per cent level is 
acceptable. 


An Example. There are conservative pr 
tate to generalize unless they can do so at the 1 per cent level o bob 5 
dence. Suppose that in a study of racial differences in emotional traits 5 
use many measures from which we compute a large number of 1 
together with their statistical significance. Suppose further that only 
differences are significant at the 5 per cent level or better, 20 significant 
between the 10 per cent level and the 5 per cent level, and only 4 dif- 
ferences significant below the 10 per cent level. Here is an instance in 
which holding to the 1 per cent level of confidence would result in gen- 
eralizing that there is very little evidence that real differences in — 
tional traits exist between the races. In fact, the evidence as presentec 
strongly favors positive but small differences, The results would support 
a generalization to the effect that there were small differences in emo- 


tional traits between the racial groups which could not be accounted for 
by the operation of chance factors, 


scientific psychologists who hesi- 
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TO-BE-G 


ITH THE CORRESPONDENCE 
PERIMENTAL AND THE 
ENERALIZED-TO SITUATIONS 


e for a given generalization 


7 ause we make mistakes of 
judgment concerning the nature and significance of the similarities be- 
tween the experimental and the to-be-generalized-to situations. 

Errors Due to the Use of Nondetermin, 


ant Similarities. The Nature of 
be attributed to certain factors 
to these factors there will be 
ally Present, do not bear directly on the 


the results, It is possible for an investigator to confuse 
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these two types of factors. It is possible for him to attribute the results 
to the nondeterminant factors, which he, of course, believes are func- 
tioning as determinants. When this mistake has been made, errors in 
generalization are likely to occur. Obviously, if the to-be-generalized-to 
situation contains these factors but does not contain the factors that 
actually functioned as determiners in the original experiment, the gen- 
eralization cannot be valid. 

An Example. To illustrate this error, let us refer to certain generaliza- 
tions of those who call themselves characterologists. Among the impli- 
cations stemming from the relationship between personality and bodily 
mechanisms, which were listed earlier, body chemistry was mentioned as 
a possible determinant of personality responses. In one system of charac- 
terology the relation between body chemistry and personality is reduced 
to what is called the law of color. This law refers to the difference in 
skin pigmentation of persons of the same race, such as the difference 
between blonds and brunets. The law holds that the more pigment pos- 
Sessed by a given person, the more markedly will he possess the charac- 
teristics of the brunet in his physical and mental constitutions. The fun- 
damental assumption is made that there are distinct personality dif- 
ferences between blonds and brunets. However, the results of several 
experimental studies show this assumption to be false. 

The error with which we are here concerned is that of treating the 
Pigment factor as a determinant of personality. Granting the false as- 
Sumption that there are significant differences in personality traits be- 
tween these two groups, we are still faced with the problem of finding 
the Possible determinants of such differences. The fact that our stereo- 
typed thinking associates certain traits with blond-complexioned persons 
and certain other traits with brunet-complexioned persons is not in itself 
evidence that pigment plays a determinant role. This kind of fallacious 
reasoning underlies the logic of all of the systems of characterology 
in which anatomical signs are first associated with personality responses 
and then thereafter are considered the determinants of these responses. 

Errors Due to Noncorrespondence in the Characteristics of Populations. 

he Nature of the Error. Error may result when a generalization based 
om a hypothetical population is extended to a particular real population. 
it was pointed out earlier that when a scientific psychologist is interested 
in learning about the nature of some psychological process as a phe- 
nomenon distinct from the individuals manifesting it, he may not concern 
umself with the nature or characteristics of the sample of subjects he 
uses. This procedure is justified and is scientifically sound. The subjects 
are considered a sample of some hypothetical population. If the findings 
Tom studying these subjects are favorable, then the hypothesis is con- 
Sidered substantiated. Any generalization made is concerned with the 


258 Steps of the Scientific Method 


nature of the psychological process investigated rather than with the 
nature of the individuals used as subjects, It is to be understood, of 
course, that the characteristics of the particular individuals studied may 
have contributed in some way to the findings obtained. Error becomes 
possible when the generalization is applied to some particular group of 
persons differing from the experimental subjects. 

Sometimes it is important to apply a generalization evolved in this 
way to a particular group of individuals. It may happen that the psycho- 
logical process investigated is an important characteristic of certain 
classes of persons and that the findings concerning the psychological 
o these individuals if they were ap- 
must be broadened to include the 


sychological process probably should 
Sroups of individuals. It is important 


arning about the most 
are teachers in a boys 
to 12-year-old students in 
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the new population probably had the characteristics that were essential 
to the results obtained in our experiment. 

Errors Due to Noncorrespondence between Groups and Individuals. 
The Nature of the Error. Generalizations about psychological processes 
based on a study of a group are not applicable in every detail to new 
individuals. When psychological processes are being studied, it is almost 
impossible to form a group in which every individual possesses every 
determinant underlying the psychological process under investigation. 
Describing a psychological process of a group, of course, is done by 
studying the individuals comprising the group, but the generalizations 
formulated encompass those aspects that are found to characterize the 
group as a unit. For a generalization based on a group to hold for an 
individual it is necessary that the individual possess a minimum number 
of the determinant characteristics discovered in the group. Just what 
these characteristics must be will depend upon the nature of the gen- 
eralization that is made. 

Generalizations about specific behaviors to be expected in any new 
individual are subject to gross errors. Generalizations that are restricted 
to expectations of probable tendencies in the new individual have a 
higher likelihood of being valid. 

An Example. This error is often made when we make predictions about 
Students’ performances in college from a knowledge of their intelligence- 
test scores. It has been amply demonstrated in studies of groups that 
there is a high correlation between intelligence test scores and college 
Success, The average success in college has been computed for various 
levels of intelligence, and it has always been found that the level of 
Success increases with increase in the level of intelligence. This is merely 
a problem in statistical computation. Knowing a given student's intelli- 
Sence level, however, does not enable us to make a precise prediction 
about his future success in college. This is to say that we cannot ascribe 
to him any precise level of college achievement. We must state our gen- 
cralization in terms of probability. We can say that he has a high chance 
of making a certain achievement level or a low chance of making some 
other achievement level, but we cannot stipulate the precise level he will 
achieve. This is because we are uncertain of the extent to which the 
eterminants of college success operating in the group originally studied 
will be operating in the given individual. Our generalizations must then 
© probabilistic in nature. 

Errors Due to Generalizing about Elements or Relationships Not Em- 
Pirically Tested. The Nature of the Error. This is a form of overgenerali- 
zation, Having verified a theorem and thus obtained evidence substan- 
Hating his hypothesis, the investigator may be inclined to broaden his 
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generalization to include 1 6 and relationships that were not in- 
i This usually leads to error, ; 
eee 5 confirms all of the elements and relationships ints 
ees se The breadth of any hypothesis is reduced as we a 
develop its implications and theorems and devise 5 et 
supply empirical knowledge. Seldom can a theorem be formulate 
will encompass all of the implications of a hypothe eca od 
tion of the hypothesis is necessary to bring it within the limits of a con 
able theorem. There are, then, elements and relationships of the hypot 1e 
sis that are removed from consideration in this process of easing a 
empirical test. Of course, if these omitted elements and relationships Ry 
closely related to those that are tested, a generalization can be per 
about them, provided that it is presented as a tentative suggestion og 
the full expectation of checking it empirically with additional tests. 
es a practical application to individual te 
especially if the new elements and relation- 
m those originally investigated. Additional 
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ze, color, amount of copy, illustrative material, amount 
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studied the effect of variations in these different characteristics on the 
attentive and memory value of advertisements. Some investigators be- 
came overly enthusiastic abo nd generalized beyond 
their data to the practical sales situation, Some of them—albeit not the 
most reputable investigator sed business and advertising firms 
on how to construct advertisements. to achieve the greatest attraction 
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trasted with the specific mental set of the experimental student in at- 
tending to the advertisements; the appearance of advertisements among 
other reading material in the layman’s experience as contrasted with the 
presentation to the students of made-up booklets composed solely of 
advertising matter; the many variable ways in which the layman is ex- 
posed to advertising as contrasted with the artificial and restricted 
exposure given the students; the buying interests of the lay public as 
contrasted with the students’ interest in following the instructions of the 
experimenter; the many ways in which the potential buying public differs 
in personal characteristics from a group of college students engaging in 
an experiment in psychology. It should be apparent that many of the 
factors that operate to determine a lay buyer’s reaction to an advertise- 
ment cannot be duplicated in the experimental laboratory. Any generali- 
zation from a laboratory experiment to a practical advertising situation 
would then be extended to elements and relationships that actually could 
not have been empirically tested in the experimental study. 
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PART THREE 


Some Individual Scientific Procedures 


In Part Two the separate steps involved in conducting a scientific 
study were discussed. We learned that the scientist first develops a prob- 
lem, framing it in the form of a hypothesis. Next he derives a theorem 
that represents the elements of the hypothesis and sets up an empirical 
situation in which the theorem can be tested. In the empirical test, he 
uses procedures that will provide facts that are pertinent to the theorem 
and that can be subjected to logical and statistical analyses. Lastly, he 
draws inferences in the form of conclusions or generalizations which 
Project beyond the limited boundaries of his own study but which are 
directly supported by the facts he has collected. These are the major 
Steps of the scientific method. 

We are now ready to consider some of the more specialized procedures 
that the psychologist uses in his research studies. Some of these proce- 
dures can be subsumed under the term experimental because they are 
Primarily concerned with the production, arrangement, ordering, and 
recording of the expressions of the variables being studied. Some of them 
can be classified as measurement procedures because they are primarily 
Concerned with the quantifying of these expressions. A hard-and-fast 
line should not be drawn between these two categories. Experimental and 
Measurement procedures are complementary in function, and in nearly 
very study they must be used together in order to accomplish the desired 
ends. A very large number of specialized experimental and measurement 
Procedures have been devised. Only a few of the major ones can be 
Considered, 

In Chap. 12, some of the procedures devised to control factors in the 
Physical stimulus situation are described. A very common problem in 
Psychological experimentation concerns the relationships existing be- 

een the characteristics of the physical stimulus and the nature of the 
ensuing psychological experience. Many specialized procedures have 
8 8 developed to control and manipulate physical stimulus factors in 

er that accurate comparisons between stimuli can be made. To this 
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d, much attention has had to be given to ways by which equality 
a 2 sical can be achieved and maintained. Another recurring prab 
nee psychological research is that of controlling interfering Ta 
arising within the experimental procedures themselves. NAE on te ~ 
a subject react to the particular stimuli the experimenter presen s 1 
is also affected by the timing, ordering, spatial arranging, abe, o a 
stimuli. These factors must be kept constant unless they are part o 
experimental variable under investigation. — 

Experimental procedures connected with factors in the ar a er 
subject are discussed in Chap. 13. The behavior of a subject can be Bas 5 
preted as reflecting fundamental psychological predispositions. : ~ 
predispositions are defined broadly as tendencies to respond in certa 


ways, and can be classified as interests, attitudes, abilities, and past 
experiences. Predispositions function n 
to be investigated but also 
them demands the use of 


ot only as experimental variables 
as unwanted interfering variables. Controlling 
many specialized procedures. 
Because of the complexity of human behavior, a large number of pro- 
cedures are needed for quantifying the different characteristics of aor 
sponse. In Chap. 14, the following major methods used for maaan 
psychological variables are described: frequency of occurrence, highly 
structured tests, inventories and questionnaires, unstructured stimulus 
situations, ratings, and interview procedures, 
Coincident with the development of valid expe 
ment procedures the psychologist has broade 


the investigation of behavior phenomena not found in highly controlled 


laboratory situations, Examples of several different kinds of field-type 
studies are presented in Chap. 15. 


rimental and e 
; n 
ned his interests to include 
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CHAPTER 12 


Physical Variables in Psychological Experiments 


The problems to be discussed in this and the next chapter are not re- 
stricted to those that might occur only in a psychological laboratory. A 
Scientific experiment contains essentially the requirements we have dis- 
Cussed in preceding chapters, namely, a working theorem, a procedure 
for collecting data, logical and statistical analyses of the data, and gen- 
eralizations evolved from these analyses. These requirements can be 
met in many situations outside of the so-called experimental laboratory. 
In the following discussions of experimental procedures no attempt is 
made to distinguish them in terms of their being used either in or out 
of the psychological laboratory. 

In the present chapter, we shall discuss methodology as it is condi- 
tioned by factors associated with the physical stimulating situation. By 
Manipulating physical factors in the experimental test the scientist can 
achieve control over many of the variables he wishes to study. In the 
following chapter, we shall consider methodology as it is conditioned by 
factors residing in the responding subject. 


VARIABLES OF AN EXPERIMENT 


Before discussing some of the special procedures used by the psychol- 
Bist for experimenting with behavior, we should have in mind a gener- 
alized picture of the variables that are likely to be functioning in any 
experimental study. 

The Logic of an Experiment. The primary purpose of an experiment 
1S to collect factual evidence pertinent to a theorem. Obviously, this evi- 
dence is going to be found in some form of human or animal behavior. 
Tt is then necessary that there be an empirical situation in which be- 

avior can be observed and, when possible, registered in some form of 
Permanent record. 

In order to be useful the behavior must supply evidence that is related 
to the theorem. In any situation in which humans or animals are respond- 
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ing, behavior determiners that are not a necessary part o 125 3 
testing situation will always be found; that is, there wi “i 1 
that is not required as evidence for the theorem. Some of this pe 7 55 
will be pertinent to the theorem, however, and therefore it 7 — 
considered in the experimental planning. Some of the behavior as 5 
no significance for the theorem. The investigator must distinguish am n 
these several kinds of behavior in order that he can evolve the best p 
ible case for his theorem. 

ge different kinds of behavior usually can be distinguished by Han 
ing the responses to the several sources of stimuli that are opa 
Some stimulus conditions are purposely established by the investiga ‘a 
to elicit expressions of the variables that are selected for testing thop 
theorem. Usually there are other stimulus conditions present that 3 
pertinent to the theorem but are not part of the variables selected 22 
the empirical test. It is important that the effects of these latter condi 
tions are not overlooked in the evaluation. 

Objectives of the Experimental Design. If an empirical test is to — 
nish evidence pertinent to a theorem, it is necessary that the experimenta 
Conditions be designed to meet the logical relationships that are de- 
manded. This is accomplished through several general objectives. ol 

The experimental procedures must be designed to elicit and contro 
the expression of the variables through which the conditions of the the- 
orem are represented. We have called these the experimental variables. 

The experimental procedures must be designed to control the expres- 


sion of all other variables that are operative in the empirical situation. 
We have called these variables 


unwanted systematic variables and unsys- 

tematic variables, . 
The gister all behavior 
that may either direct] ain to the testing of the the- 


at make a faithful and 
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The foregoing statements express the ideal experimental situation 
toward which we should strive. We may not always be able fully to 
attain each of the objectives outlined, but to the extent that we are suc- 
cessful, to that extent will we accumulate sound experimental evidence 
that can be applied to the hypothesis we are studying. 

Major Classes of Determinants in a Psychological Experiment. The 
determinant variables of an experiment can be distinguished in terms 
of their pertinence or nonpertinence to the theorem, as described in the 
preceding sections. They can also be distinguished as being in the physi- 
cal world or being within the behaving organism. Determinants in the 
Organism can be further distinguished as being primarily physiological 
or being primarily psychological in nature. We here need,merely to men- 
tion these distinctions, leaving further clarification for later sections. 

Let us briefly consider the classes of variables that can be manipulated 
in order to elicit the behavior pertinent to a theorem. One way we can 
Purposely elicit a desired change in behavior is by manipulating the 
Physical-stimulus situation. By changing the quality, intensity, amount, 
location, etc., of objective stimuli, we can elicit change in the experi- 
€nce and overt behavior of the subject being stimulated. Similarly, by 
changing the stimulus situation internal to the organism we can elicit 
Variations in the reacting subject. Much less precision is usually attained 
in controlling this type of stimulation than in controlling external physi- 
Cal stimuli, Behavior change pertinent to a theorem may also be pro- 
duced by manipulation of the physical mechanisms of response. In gen- 
eral, all behavior can be referred to changes in sense organs, nervous 
System, endocrine glands, and muscles. By purposely changing the func- 
tioning condition of one or more of these mechanisms, it is possible to 
Produce yariations in behavior. Finally, there are many psychological 
determiners of behavior, such as attitudes, abilities, skills, past experi- 
ences, and the like, that can be manipulated as experimental variables in 
the testing of theorems. Changes in these variables cannot be accom- 
Dlished with the directness possible in the manipulation of physical- 
stimulus objects, but controlled variation can be achieved with sufficient 
Precision to make these psychological predispositions a very fertile area 
er studying behavior change. 

s already stated, we must design procedures to control and record 
Variables operating coincidentally with those purposely associated with 
the theorem being tested. Although these coincident variables are not 
expected to produce differential effects, nevertheless they may do so. 
nasmuch as they are either real or potential sources of behavior change, 

ey must be treated with the same thoroughness as is applied to the 
©xperimental variables. Control of these coincident variables must be 


268 Some Individual Scientific Procedures 


achieved in the same ways as were described for the a 
variables; namely, by manipulation of the external eee 1 a 
the internal stimulating situation, the physiological ee 1 
sponse, and the predisposing factors of a psychological oer i 5 a 
ing with these coincident variables, the general purpose is cithe es 
move them entirely from the empirical testing situation or to keep 
effects constant in all phases of the experiment. 


THE CORRESPONDENCE BETWEEN PHYSICAL-STIMULUS 
FACTORS AND PSYCHOLOGICAL-RESPONSE FACTORS 


We should not interpret the relationship between the objective 8 
lus and the subjective experience or response as necessarily being simple 
in nature. Rather, this relationship varies through a very wide — 
from the simple relations found in some experiments on sensory phe- 
nomena to the very complex and relatively little understood relations 
encountered in experiments on the higher mental processes. = has 

Simple Relationships in the Area of Sensory Phenomena. By restricting 
the change in the physical stimulus to one attribute and registering be- 
havior change in terms only of the changes in the experience of the sub- 
ject, a simple relation between the stimulus and the response is often 


à g 22 587 jnesthesis have 
encountered. Experiments in vision, audition, touch, and kinesthesis ha 
yielded such simple relationships. 


The relationship between the 


brightness is a case in point. WI 
low, we experience 


y ; “ience of 
intensity of light and the experience a 
hen the intensity of the stimulus light is 


Š 3. 3 $ š sity 
a low degree of brightness. With increasing intensity 
we experience an increasing brightness, 


experienced which we refuse to endure 

Similarly, in the field of he 
tion in the stimulating objec 
loudness until we reach a le 


until a dazzling brightness 


one attribute, the corresponding sensory ex 
cally by changes within this attribute. 

Relationships in Simple Space-perception Situations. When the sub- 
ject is called upon to make a judgment about certain spatial character- 
istics of environmental objects, there may be a very low degree of corre- 
spondence between the physical stimulus change and the subjects 
responses. 


perience may be altered radi- 
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This can be illustrated with judgments of distance in visual space. 
It is a well-known physiological fact that the size of the retinal image of 
an object becomes smaller as the distance of the object from the eye is 
increased. A given object located far away produces a small retinal 
image; this object located close by produces a large retinal image. We 
would expect, then, that our experience of the distance of objects would 
follow closely this relationship and that in judging distance we would 
always see the object with the larger retinal image closer than the object 
with the smaller retinal image. Another factor enters, however, which is 
our knowledge of the “natural” sizes of objects—the average sizes of 
objects when they are perceived equally distant away. We know that a 
horse is larger than a man. so the retinal image of a horse is on the average 
larger than the retinal image of a man. When we estimate the relative 
distance away of a horse and a man, this knowledge somehow influences 
our judgment. A horse located only a small distance farther away than 
a man will produce a larger retinal image than the man. If we were using 
simply the relative sizes of the retinal images, the horse would be judged 
closer than the man, but in spite of the fact that the retinal image of the 
Man is smaller than that of the horse, the man is judged to be closer. 

In the area of visual illusions we find many examples of lack of cor- 
Tespondence between the subject’s experience and the variation in the 
Physical-stimulus object. For example, when we look down a railroad 
track we observe the rails coming together at a point near the horizon. 
Here is an instance in which the form of the object as experienced dif- 
fers from that which we know to be true in the objective world, The 
experimental psychologist has subjected various kinds of illusory figures to 
analysis in an attempt to find out what aspects of the stimulus are respon- 
sible for arousing the illusory experience. The pattern of the lines com- 
Prising the eile has been found important in many cases. but the direc- 
ton of the absense attention, his eye movements, and his mental sets 
and attitudes also play determining roles in the response. 

“New” Sensory Experiences without Counterparts in the Physical 
Stimulus, Sometimes when we continue changing the physical stimulus 
m what appears to be a simple progression, an experience may occur 


that does not have a counterpart in any physical characteristic of the 


stimulus, This can best be illustrated in the area of hearing, with what 
are called intertones and difference tones. 
two pure tone stimuli, equal in intensity, are varied in vibration 
rate, several changes in experience may occur. When the vibration rates 
of the two sounds differ by only a few cycles per second, only one tone 
is heard. It is the intertone, and it corresponds to a tone that in terms of 
Physical characteristics would have a vibration rate intermediate be- 
een the vibration rates of the two sounds. As the difference between the 
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vibration rates increases, the intertone gradually fades out and the two 
ri es are heard. 
ai When i two sound stimuli vary in frequency by more than a = 
or 50 cycles per second, a third tone is heard which has a Lo ais 
responding in vibration rate to the difference between the rates 1 
Primary vibrations. This is the first difference tone. A difference ton a 
subjective tone arising from mechanisms within the ear. It has a — 
terpart in the vibrations of the stimulating objects. If the subject is : ie 
vided with a tone variator, which enables him to change the = 
tones, he can produce a tone from a physical source that matches 8 
difference tone in pitch. This tone has a vibration rate that equals 
difference between the rates of the two stimuli from which the 1 
tone originates. Several difference tones can be made audible through the 
use of a resonator or amplifier, ; P 
Relationships in More Complex Behavior Situations. The ier? 
between the physical stimulus and the consequent behavior of the sub- 
ject becomes increasingly complex as the stimulus becomes more me 
bolic in function and as the number of responses potentially associate 
with the stimulus becomes greater. As the symbolic function of the 
stimulus increases in importance, there is a greater variety of ways in 
which the experiences, attitudes, interests, abilities, etc., unique to an 
individual can contribute to the response that he makes to the stimulus. 
The effect a stimulus will have is then less accurately predicted from i 
knowledge of the physical dimensions through which it might be varied. 
The free word-association test, although a rather simple form of sym- 
bolic stimulus, Presents a situation that results in a wide variation in the 
responses elicited from different subjects, In this test the experimenter 


s 
ect replies with the first word that come 


black 308 cloth ...... 17 almost . L DE saspe T 
color «s 170 paper 17 body 1 lovely ...-- 4 
snow ..... 91 colorless 11 cherries l napkin T 
light -<= BL clean wead O Ga l pretty ....- 4 
dark 35 blue e sie D d T 
good 1 rightness 
dress .. 34 mik... 9 hard eee A eee 4 
pure 20 red. T innocence 1 SWAN groesi : 
purity .... 19 green 6 lady 1 trousers 4 
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STIMULUS COMPARISON 


The Problem. One of the first problems investigated by experimental 
psychologists concerned the functional relationship existing between 
various aspects of the physical stimulus object and the resulting experi- 
ences reported by the subject. A wide range of stimulus characteristics 
in the several modalities of sense was studied. 

The primary problem was to quantify the relationship existing be- 
tween a given stimulus characteristic, e.g., intensity of vibration of a 
sound stimulus, and the resulting reported attribute of experience, e.g., 
loudness of the sound. This presented a problem of measurement in 
respect to both the stimulus and the response. Quantification of the 
stimulus characteristic was readily attained for most sensory stimuli, 
especially in vision and hearing, because of the ease with which the 
characteristics of objective stimuli could be represented in terms of 
physical measuring units. There was no obvious corresponding way to 
measure changes in experience, and so the establishment of some scale 
of measurement for the sensory experience occupied the attention of many 
of the early investigators. 

From the work of these investigators were developed several important 
experimental procedures, particularly those that are known as the psycho- 
Physical methods, These were evolved in the study of problems in sen- 
dation and perception, such as the absolute threshold (the minimal 
amount of a stimulus that can be detected) and the difference threshold 
(the minimal stimulus difference that can be detected). Here there are 
two series of values, that of the physical stimuli and that of the subjects 
judgments. Additional procedures were evolved in areas in which there 
no quantitative scale of physical stimuli, as in measuring the subject's 
Preferences, It is obvious that judgments of the subject can be obtained 
in which he indicates that he prefers one stimulus rather than another, 
o.g., one color rather than another, or one painting rather than another. 

ere is no way of arranging the stimuli on a common physical scale, 

ut it is possible through the judgments of the subject to arrange the 
Stimuli on a scale according to his preferences. These procedures are 
calleq Psychological scaling methods. , 

Pace does not permit describing all of the psychophysical and psy- 
chological scaling methods. Certain of the standard and frequently used 
Procedures will be briefly described and illustrated. 

The Method of Constant Stimulus Differences. As the name implies, 
this method is used to measure differences between stimuli. A limited 
number of stimuli are used, one standard and usually from four to seven 
Comparison stimuli. Pairs of stimuli (the standard and one comparison 
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stimulus) are presented to the subject, usually in a —— — 
subject makes a judgment each time a pair is n me 
whether one stimulus is either greater than, equal to, or less than 
co we are interested in measuring the difference Gees mt 
linear extension, that is, the amount that must be added or — 
from a given length in order for it to be scen as different in — a 
simple device for presenting the problem to the subject is to use b a 5 
lines on strips of white cardboard. A standard stimulus card is e 
with a line of a given length, say 12 inches. Other comparison cards 0 i 
prepared, say seven, one of which is equal to the standard, while the 
others differ from it, three in which the length is longer and three ii 
which the length is shorter than the standard. The lengths of these langer 
or shorter lines are not simply guessed but are worked out in preliminary 
experiments. They must differ from the standard without any given line 
being of such a length that the subject will judge it as always longer or 
always shorter than the standard line. — 
The experimental procedure requires the standard card to be paire¢ 
with each of the comparison cards and these pairs presented to the sub- 
ject a large number of times in a random order. The subject is asked to 
state whether the standard stimulus is longer than, equal to, or shorter 
than the comparison stimulus. The per cent of times the standard is 
judged longer (or shorter) than each variable is determined. The thresh- 
olds are then statistically computed. The upper threshold is that differ- 
ence in length that yields a judgment of longer than 50 per cent of the 
time. Similarly, the lower threshold is that difference that yields a judg- 
ment shorter than 50 per cent of the time. 


The Method of Minimal Changes. This is also known as the method 
of limits. In this method, a series of stimuli is presented to the subject 
each stimulus differing by a slig 


ht amount from the preceding one, the 
series being continued until a critical change occurs in the subjects judg- 
ment. The experimenter manipulates the stimulus, Sometimes in the series 
the stimuli ascend (increase) in value, and sometimes they descenc 
(decrease) in value. 


This method is admirably suited for determining the absolute threshold. 
Suppose the absolute threshold for intensity of sound is desired. A tone 
of simple frequency is selected and the necessary electrical control cix- 
cuits devised so the tone can be gradually increased and gradually de- 
creased in intensity. In the ascending series the tone is set at an intensity 
well below the threshold a i 


nd gradually incre 
until the subject detects the tone. In the de: 
at an intensity well above the threshold an 
trials until the subject say. 


«jve trials 

ased over successive tr apt 
8 P is 8 

scending series the tone is 5 
d decreased through a series 


è 3 resho 
s he can no longer hear the tone. A threst 
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value can be computed for each series. The best estimate of the general 
threshold value is computed by averaging the threshold values of all of 
the individual series. 

The Method of Average Error. This method is also called the method 
of reproduction, because the subject manipulates the comparison stim- 
ulus. He is presented with a fixed standard and an adjustable comparison 
stimulus. His task is to manipulate the comparison stimulus until it ap- 
pears equal to the standard. Sometimes the comparison is set at a value 
below that of the standard and the subject has to increase it to reach the 


Fic. 11, Diagram of subject's control adjustment in Miiller-Lyer Illusion Experiment. 
he value of the comparison stimulus 


Point of equality. At other times t timul 
and the adjustment is in the direction 


'S greater than that of the standard 
of decreasing the size of the comparison stimulus. 

This method can be used to measure the influence of extraneous factors 
se situations in which a judgment of sub- 


upon human judgment in tho a 
An example is a study of the Müller-Lyer 


Jective equality can be made. 
illusion, as seen in Fig, 11. The apparatus is constructed so the line lengths 
tween the points of the arrows can be adjusted both by the experi- 
Menter and by the subject. If only one-half the illusion is to be adjusted 
size by the subject, the other half of the figure is stabilized. It will 
de noted in Fig. 11 that the adjusting rope is made in the form of a 
Complete loop, enabling the subject to move the adjustable portion of the 
arrow either to the left or to the right. For a given trial the experimenter 
makes the length of the adjustable arrow different from the stationary 
ne and directs the subject to manipulate the adjustable line until he 
Judges the two parts of the illusion as equal. Some of the factors that 
may be investigated in regard to their relation to the judgment of equality 
are the length of the arrows, the length of the arrow tips, the angle of 


© arrow tips, the space error (adjustment made only on the right or 
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only on the left), the movement error (adjustment in only a decreasing 
direction or in only an increasing direction), time error (variation in 
the time given for making the adjustment). b 

The Method of Equal-appearing Intervals. Instead of asking the su 5 
ject to give a judgment of present or absent (absolute-threshold experi- 
ments), or of same or different (difference-threshold experiments ), we 
may wish him to judge the distances between stimuli. We can ask ap 
to bisect a tonal interval, e.g., find a stimulus tone the loudness of which 
falls midway between that of two others; or to bisect the distance between 
two gray values, e.g., select a gray that falls midway between two other 
grays. The method varies in detail according to the nature of the prob- 
lem; e.g., the experimenter manipulates the stimuli in some problems, the 
subject in others. 

An adaptation of this method has been successfully applied in the 
scaling of propositions that are designed to reflect differences in attitude 
toward some social problem. Statements of opinion are collected on some 
fundamental issue, e. g., attitude tow: 
from being extremely f 
About a hundred state 
tinuum are selected for 


ard war. They vary in all degrees 
avorable to extremely unfavorable to the issue. 
ments representing points along the whole con- 
scaling. These statements are given toa hundred 
or so judges to sort into categories that appear equally separated along à 
scale. Usually eleven categories are used. The middle neutral category 
and the extreme favorable and the extreme unfavorable categories Serve 
as standards or anchor points. Several statistical procedures are applied 
for selecting statements that have high discriminating power and fall 
at approximately equal points along the continuum from one extreme to 
the other. Such statements can be formed into an attitude scale for 
measuring either individual or group attitudes towards the social issue 
involved. Below are several statements from an attitude scale on war de- 
vised according to the method of equal-appearing intervals. 


Compulsory military training in all countries should be reduced but 
not eliminated, 


The benefits of war outweigh its attendant evils. 
He who refuses to fight is a true hero. 
An organization of all nati 
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pairs. Each pair is presented separately to the subject who designates the 
member of the pair that he prefers. The number of pairs is much 
larger than the number of individual stimuli (number = n(n —1)/2). 
The amount of work required of the subject becomes excessive when 
about 15 or more stimuli are to be judged, and special short-cut pro- 
cedures have been devised for reducing it. Various statistical formulas 
are available for determining the scale separations of the stimuli forming 
the continuum. 

One study in which the method of pair comparison was used was in 
determining preferences for national groups. The name of each of nine 
national groups was paired with that of every other group. The subjects 
were shown each of the 36 pairs of names and asked to state a prefer- 
ence. By the pooling of the responses of a large number of subjects, pro- 
Portions were computed indicating how frequently each national group 
Was preferred to each other group. Statistical procedures were used to 
Place the several groups on a common continuum of preference as de- 
termined from the judgments of these particular subjects. In this study 
the national groups, as ranked from the most preferred to the least pre- 
ferred, were scaled in the following order: English, Scotch, French, 
Swedish, Italian, Russian, Greek, Mexican, Turkish. 


STIMULUS EQUALITY 


The Problem. In psychological experimentation there are several major 
Problems connected with the equivalence of stimuli. It is frequently 
Necessary to produce stimulus conditions that will have the same stimu- 
ating effect upon the subject whenever they are presented to him. A 
Closely related problem is the need to be able to repeat exactly for each 

_ Subject the changes introduced in a stimulus during the various conditions 
an experiment. A further problem is to maintain constancy in those 
aspeots of the stimulus that are not part of the experimental variable 
under study, 
he difficulties encountered in solving these and related problems of 
the Stimulus situation vary greatly with differences in the nature of the 
Stimulus, In the areas of behavior in which there is a fairly close corre- 
ation between the characteristics of the physical stimulus and the char- 
acteristics of the response elicited, the problems are usually readily 
Solved, Most of the experiments on sensory thresholds support this 
Statement, The loudness, pitch, and in some respects the timbre of sounds 
sh closely associated with physical aspects of the vibrating body. Loud- 
= z is associated primarily with the amplitude of vibration, pitch is 
ciated primarily with the frequency of the vibration, and timbre can 
© made to undergo systematic variation by manipulating the intensity 
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of the several vibratory frequencies composing a complex wave. A — 
correlation is found in vision, with color being associated with piston 
length, brightness with the amplitude, and saturation with the rela 
amount of the several wave lengths composing the stimulus. w 
Problems of stimulus equivalence are more difficult to solve when = 
subject’s response is less closely correlated with the physical — 
istics of the stimulus and is more closely associated with the symbolic 
or representative functions of the stimulus. The meaning that any Se 
ceptual, memorial, imaginal, or reasoning stimulus item has for a 11 0 
subject determines in large part the response he makes to it. The prablem 
of equating stimuli in these areas is then more difficult because ena 
lence is no longer a simple function of the physical characteristics of the 
stimulus but is a complex function of the 


3 s : : jvalent, as 
the responding subject. To make certain verbal materials equivalent, as, 
for example, the words to be used in 


a learning experiment, the invasi 
gator must study the meanings that these verbal materials have for the 
subjects on whom they are to be used. To make 
arithmetical processes equivalen 
work, the investigator 
subjects, 


psychological characteristics o 


problems involving n 
: i s sita 
t as materials for an experiment in mA 
i é ne 
must know the arithmetic background of 


Another area in which stimulus equivalence is a function of the a! 
ject’s characteristics has to do with the complexity of the physiologic 
response required in executing some particular behavior. For example, 
coordination of response and speed of response are not the same function 
although they are closely related. A subject may be able to perform 2 
certain psychomotor coordination with 100 per cent accuracy if he k 
allowed to pace himself; whereas if he is forced to work faster than his 
normal speed he may perform very badly. The effectiveness of his re- 


sponse is not a simple function of his ability to do the complex response, 
but is also a function of his abilit 


y to vary his rate of work. eri 

The Equivalence of Sensory Stimuli. This topic needs little aaduan 

comment. It was noted in preceding discussions that equivalence in most 

instances can be achieved by making the physical characteristics of os 

stimulus equal, An explanation of how this is done, as in the field of heat 
ing by means of ele 


: 8 z r 
ctrical circuits or in vision by means of prisms p 
filters, would lead us too far afield. Suffice it to say that if we understan 

the physical nature of 


a stimulus, and if there is a close relationship 
between the physical characteristics and the psychological experience’ 
a high degree of precision in Manipulating and controlling the stimulat- 
ing effects can usually be attained, 

Stimulus Equality 


as a Function of F 
stimulus characteristi 


I 1 ith the 
amiliarity. Familiarity with 
c being studie 


Ria the 
d conditions the response that 5 
timulus characteristic of importan 
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for testing a theorem does not touch upon experiences common to all 
of the subjects, or touches upon the experiences of various subjects in a 
differential manner, it will not then serve as an equivalent stimulus. If 
variation in response due to differences in familiarity with the stimulus 
is not the experimental variable under consideration, then any behavior 
change resulting from this factor must be considered as experimental 
error, 

The factor of familiarity with stimulus characteristics is encountered 
in most experiments in which symbolic materials are utilized. The prob- 
lem of task familiarity arose early in experiments in learning and memory 
in which verbal materials were used. Nonsense syllables were devised as 
a means of creating memory material equally familiar to most persons. 
In this instance, the equivalence of stimuli was achieved by making the 
material equally unfamiliar to most persons. 

Equality as a Function of the Abstractness of the Stimulus. In many 
experiments on the higher mental processes, the purpose is to measure 
the subject's power to behave independently of the nature of the stimulus 
materials he is called upon to manipulate. The more abstract and general 
the stimulus material can be made, the more independent the test be- 
Comes of the particular nature of the stimulus. A positive relationship 
is usually found between the abstractness of the material and the amount 
of ability required to manipulate it; that is, the more abstract the stimulus 
characteristics, the greater the ability demanded. For example, in prob- 
lem solving, the more abstract the characteristics and relationships in- 
Volved in a problem, the more difficult is its solution. 

In this as in other experimental situations, there are the two problems 
of keeping the stimulus at a constant value during a given condition and 
Producing the stimulus at different amount levels in different experi- 
Mental conditions. In regard to verbal or symbolic stimulus materials, this 
Means devising individual stimulus units, such as items which are equally 
abstract in nature for all the subjects. Further, it means that items or 
Units must be devised that differ in some magnitudinal way in the degree 
of abstraction involved. Several levels of abstraction may be required, 
With the items within any level being equally abstract in nature. 

One method frequently used to achieve equally abstract stimuli is to 
Teduce greatly the meaningfulness of the material by using nonlanguage 
or simple language stimuli. One example in which this was done is an 
experiment on concept formation in which Chinese characters were 
Paired with simple English words. It will be remembered that a concept 
aAa meaning that has been found common to many otherwise diverse 
Situations and that has been abstracted from these experiences to the 
degree that it can be reacted to independently of them. In the experiment 
referred to there were 12 lists of pairs, each containing 12 Chinese-type 
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characters. Each of the characters in a list was built around a ees 
radical—a particular pattern of lines—and this radical was . T 
one of the characters in each of the lists. Thus there were 12 rad <r 
in a list and each radical appeared once in each of the 12 lists. An Eng 1S: : 
syllable was assigned to each of the radicals. The task of the 8 
to learn to associate the English syllable with its corresponding radical. 
The subject, of course, did not know about the radicals, and the = 
was to determine if he would learn the common feature or radical tha 
was associated with a given syllable. , 

Equivalence of Stimuli and the Problem of Difficulty of Task; Differ- 
ences among stimuli often result in differences in difficulty. Difficulty 
as here used does not refer to difficulty experienced in sensing the 
material—although in some experiments in sensation this may be a prob- 
lem—but difficulty experienced in the learning, or memorizing, or Reasons 
ing associated with the material. Several procedures are available for 
minimizing differences in difficulty among the tasks presented to the 
subject. These will not only effect an increased comparability among the 
stimuli in terms of the factor of familiarity but will accomplish the same 
result in regard to the factor of abstractness. 

One procedure to obtain equality of difficulty is to use the judgments 


of experts. This should be considered as only preliminary, to precede a 
more exact empirical determination, 
The most scientificall 


Physical Variables in Psychological Experiments 279 


be more difficult than others. It is obvious that combinations using the 
number 0, 1, and 2 are less difficult than combinations using other 
numbers. It is possible in terms of such information for the experimenter 
to select certain combinations that are likely to be more nearly equal 
than other combinations. Of course, if there are significant doubts con- 
cerning the equivalence of different combinations, the appeal to an 
empirical evaluation of the combinations is indicated. 

Stimulus Equality as a Function of Time of Exposure. It should be 
apparent that when the difficulty of the stimulus, the physiological 
adaptation to the stimulus, or the nature of the relationship between 
unwanted systematic variables and the experimental variable change 
rapidly in time, constancy of the time of exposure becomes an important 
objective. For example, in an experiment requiring the reading of a 
Passage of prose and the answering of questions on the passage, the 
length of exposure to the selection must be constant for all subjects. The 
questions become easier to answer, the longer the time given for read- 
ing the passage. 

In learning and memory experiments, several procedures for controlling 
the length of exposure of the material have been devised. One frequently 
used way of accomplishing this is to utilize material that can be easily 
divided into units, such as individual words or other symbols, arrange 
these units in a list, and then present each word by itself through a small 
aperture in front of the subject. The time of the exposure of each unit 
can be kept constant by an electrically controlled shutter device for cov- 
ering the aperture periodically. : : 

In experiments on olfactory and skin sensitivity, the exposure time of 
the stimulus is important because the rate of adaptation of the sense 
Organs is rather rapid and the recovery following adaptation is often 
very slow. When a series of stimuli are to be presented, the adaptation 
and recovery phase following one stimulus may interfere with the appre- 
Ciation of the next following stimulus. This is particularly true in experi- 
Ments on olfactory sensitivity. It is necessary that the exposure time to 
the stimulus be carefully determined to minimize this interference. 

In the glare recovery experiment described in an earlier chapter, the 
Subject was exposed to a very brilliant light. The rate of recovery is in 
Part a function of the length of the exposure to the bright light. It will 

© remembered that this function was not the one under investigation, 
and so variation in exposure had to be removed as a possible systematic 
eterminer by maintaining a constant exposure time. 

Stimulus Equality as a Function of Motor-skill Factors. Sometimes a 
Subject is required to manipulate the stimulus or part of it. We must 

zen make sure that the effect of the stimulus on the subject is not 
differentially influenced by this manipulatory response. It will be re- 
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membered that in the psychophysical method of average error the 1 
ject adjusts the comparison stimulus. If this motor adjustment is Ae 
part of the experimental variable, then it should be a response whieh in 
itself does not affect the appreciation of the stimulus; in other words. it 
should play a completely neutral role. Very often this is realized if the 
response is a very simple one in terms of motor adjustment. , 
Difficulty in performing a response required in a learning experiment 
can change the effect that the stimulus situation has on the subject. This 
sometimes occurs in experiments involving the learning of the multiple-T 
alley maze by rat subjects. In running this maze, the rat must not only 
learn to traverse the alleys but also to open one-way gates placed be- 
tween the consecutive units to prevent retracing. One type of one-way 
gate is operated by having the rat push himself under the gate, the gate 
falling closed behind him. If the rat gets only his head under the gate 
and decides to withdraw, there is the likelihood of his head being 
momentarily caught under the gate as he backs away. This experience of 
getting caught under the gate sometimes is intense enough to condition 


the rat negatively to the gate. The gate stimulus now is an object to be 


avoided rather than one to be manipulated. Thus the stimulating effect 
of the maze is differentially affected because of the rats unpleasant ex- 
perience with the manipulation of the gates. 5 
As described earlier, the subject is often given preliminary practice 
in the responses he must make in manipulating the stimulus. Thus in the 
maze experiment the rat is gradually trained to manipulate the gates. At 
first the gate is left open when the rat goes under it, Then with each 
successive trial the gate is lowered by a small amount until eventually the 


rat moves the gate up with his head without any difficulty. This train- 
ing, of course, is not given in the maze itself but in a single unit located 
apart from the maze. i 

Inal 


ater section the procedure of givin 
discuss | 


2 . ni i e 
g preliminary training will b 
ed in reference to controlling the su 


ibject’s ability and experienc 


MINIMIZING THE CONTRIBUTION OF INTRAPROCEDURAL 


FACTORS 
In experimental studies of behavior, determinant factors sometimes 
arise within the experimer 
quence that the results cam 
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The Problem. In research studies in psychology, the experimental de- 
sign requires that a constant routine be followed in applying the pro- 
cedures that affect the control and manipulation of the variables. We 
shall be concerned here with physical-stimulus factors. We shall be in- 
terested in characteristics of the physical stimulus that are associated 
with the manipulation and control of the experimental variable and of 
potentially disturbing associated variables. In particular, we shall be 
concerned with experimental procedural factors associated with unwanted 
systematic variables and unsystematic variables. Obviously, these factors 
may function to disturb the systematic functioning of the experimental 
Variable, Effecting control over potentially disturbing intraprocedural 
factors will improve the manipulation of the variables associated with the 
testing of the theorem under study. 

Equal Exposure of the Subject to the Experimental Conditions. In an 
earlier section we discussed the need for keeping constant the exposure 
time of the experimental stimulus. The present section is related to 
this, but covers a somewhat broader area. It is concerned with the equal- 
ity of exposure of the subjects to all phases of the experiment. 

It is difficult to find a satisfactory definition of the phrase “equality of 
exposure.” An acceptable working definition is equality of opportunity 
for the subject to perform in a representative manner. More specifically, 
it means equal opportunity for the subject to be influenced by any experi- 
mental variables that are functioning and equal protection from the influ- 
ence of any unwanted interfering variables. Stated as a question, the 
Problem is: Do the experimental procedures afford every subject an 
manifesting his characteristic behavior under the 


e g 2 
dual opportunity for 
nt? There are several standard procedures 


Conditions of the experime 
for Solving this problem. . 9 
ne very frequently used procedure is called the time-limit method. 
he logic underlying this procedure is that equal time to work means 
equal opportunity to achieve a representative performance. The evalua- 
tion of the performance is in terms of the quality and or quantity of the 
Product, Quality is measured in terms of accuracy; e.g., the kinds and 
numbers of errors. Quantity is measured in terms of the total number of 
Units of work completed, ‘or the number of acceptable units of work 
Completed, 
In connection with this procedure, a question may be raised concerning 
ifferences between individuals in their pace of work. Certainly, it is 
true that some individuals work faster than others. When the experi- 
to participate in every phase of the 


me : à 
“bil design requires each subject i 
factor of individual pace does not 


ex př 

à Periment, then presumably the 3 ý 
Ontribute a differential effect. But when the experimental design re- 
Wires the utilization of two or more groups of individuals and assigns a 
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given individual to only one phase of the study, then the investigator 
must equate the groups in terms of the factor of pace of work. 

A second common procedure is called the amount-limit method. In this 
procedure the subject is presented with a certain amount of work to do, 
and his performance is evaluated in terms of the time required to com- 
plete the task and the nature and number of errors committed. In some 
experiments the task is repeated several times, and performance is evalu- 
ated in terms of the number of trials required to achieve a certain level 
of proficiency, e.g., two successive errorless trials in learning a maze. 

In this procedure the subject usually is allowed time to complete the 
task, and so his pace of work is reflected in his performance. In fact, 
it contributes in a major way to the score that is assigned his performance, 
The logic underlying the use of this procedure in the measurement of 
some fundamental attribute of the individual is that 
accurate index of the amount of the attribute that he possesses. When 
different groups of subjects are to be assigned to the different experi- 
mental conditions, it is necessary that the groups be equated in this rate 

eriment is begun. 
easure performance has not always been 
re. Number of trials and time required for 
ved to be unreliable indices in the case of 
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position of the apparatus to the position of the subject. Removal of these 
errors requires a counterbalancing of the spatial relationships among 
stimulus characteristics as these are presented in the experimental con- 
ditions. The space error is encountered in the threshold experiments on 
length of lines using the method of average error. In this method the 
subject is shown a standard stimulus line and is instructed to change the 
length of a comparison line until he judges it to be equal to that of the 
standard. The comparison line can be presented either on the right or on 
the left of the standard. If the comparison line is placed always on one 
side of the standard, a space error is likely to occur. 

Another factor associated with the position of the apparatus is the 
direction of the manipulatory response required of the subject. This 
factor can readily be illustrated in the same psychophysical experiment 
described above. In manipulating the comparison stimulus to produce a 
line approximating that of the standard, the subject can move the line 
from the right to the left or from the left to the right. A movement-direc- 
tion error may result if all adjustments are made in only one direction. 

Control of these two factors, the spatial relation of the apparatus to 
the subject and the direction of the adjustive movement, in order to 
avoid procedural errors, is accomplished by the method of counterbal- 
anced order. In this method the factors are ordered in such a way that 
there is an equating of the right-left position factor and the right-left 
movement factor. The procedure is to compute the various possible com- 
binations of positions and movements and then order them in a random 
arrangement. The combinations of these factors arranged in random order 


in the length-of-line experiment are as follows: 


Standard line on right and change in comparison line by inward 
movement 

Standard line on left and change in comparison line by outward 
movement 

Standard line on left and change in comparison line by inward movement 

tandard line on right and change in comparison line by outward 
movement 


Brief mention should be made of the serial arrangement of the units 
ol material used in memory experiments. This factor is a combination 
time-and. location factor. Learning experiments often require the mem- 
Prizing in serial order of units of material like words or nonsense syl- 
lables, Words at the beginning and end of a series are learned more 
Quickly than those in the middle of the series, demonstrating that the 
Position within the series is contributing to the rate of learning. In most 
“arning studies this is not a crucial factor because it presumably oper- 
ates equally in all experimental conditions being used. If serial order is 
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suspected of having a differential effect upon the — ane Pe 
trolled by arranging the units in all possible orders and ranom y e ie 
ing for use as many we orders as can be appropriately organized 
“perimental conditions. x 
pia the Contribution of Temporal Factors within an ning 
mental Sitting. The time that the subject is exposed to the experimen a b 
stimulus has already been discussed. In addition to this, there are 7 
relations among the stimulus units within an experimental sitting = 
time relations among the several experimental conditions, both of — 
may present serious problems of control. We shall here be concerned 


with the time characteristics of procedures w 


ithin a given experimental 
sitting. 
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the like, is carried forward from one gradation or condition to another, 
it is necessary to effect a balancing of this effect among the experimental 
conditions so it will contribute uniformly to every one. 

In investigating the use of the prone position by the pilot in flying air- 
craft, the authors wanted to learn the forces that could be applied by a 
Person in executing different kinds of movements by the arms and hands 
in different variations of the prone position. Three movements of a wheel- 
type control column were selected for study, as follows: a push-pull mo- 
tion, an up-down steering-wheel motion, and a right-left sideways swing 
movement of the whole column. The prone-position bed was varied in 
height by using three levels: low, medium, and high. The subject’s body 
Was positioned at three distances from the control column: close, me- 
dium, and far. The number of experimental conditions that could be 
formed by combining the three types of movements with the three bed 
heights and the three distances from the control column was 3 X 8 X 3, 
or 27. Actually, the number of conditions was greater than 27 because 
it was desirable to test the movements from more than the central posi- 
tion of the control column, e.g., with the control column swung out to 
One side of the mid-line of the bed-control column axis. 

It was impossible within the time available to investigate all of the 
Combinations, and so nine conditions were selected, among which each 
of the three characteristics was represented in different values. These 
Conditions were then arranged in a modified random order, and this order 
Was varied for different subjects by having them begin each day's work 
at a different point in the series. The reason for randomizing and then 
Counterbalancing the order of the conditions was twofold: first, fatigue 
arising within a daily session would be dispersed equally to all of the 
experimental conditions; and second, practice or learned effects carried 
rom one condition to the next or from one sitting to the next would be 
evenly dispersed to all of the conditions. = ; , 
The counterbalancing of the temporal order of conditions = especially 
important in experiments in memory and in work, in which pr Abe and 
atigue effects may occur within short periods of iiss a EOT vn 
work using expert craftsmen as subjects, it might be 1 t . ne are 
“ealing with perfected habits, but this is not true. I F 
volves the tasks of a factory job, say operating a drill press, and we 
use as subjects workers who have been on the job a number of years, we 
Would presume that there would be no effects from further practice of 
the tasks. As a matter of fact. psychological functions seem never to reach 

is perfected practice state, regardless of the amount of past en | 

is is true even though the functions are simple in nature and are use 
many times daily by the subject, as, for example, simple coordinated 
reaching movements of the fingers, hand, and arm. In any experiment on 
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work it is necessary to take into account the possible effects of practice, 
and when the study requires the subject to perform in more than one ex- 
perimental condition, the counterbalancing of the order of the conditions 
is a standard procedure for achieving an equilibration of this practice 
effect. 

Counterbalancing of the Order of Experimental Conditions through 
Their Random Assignment to Subjects. This method needs only to be dis- 
cussed briefly, because it is very similar in purpose and result to counter- 
balancing experimental conditions in time. In fact, it is frequently com- 
bined with the temporal procedure, as was done in the experiment on 
prone-position responses. 

In the present procedure, the order of the experimental conditions is 
varied by assigning a different order to each subject or group of sub- 
jects. Suppose the experimental conditions are a, b, and e, and it is sus- 
pected that a practice effect might occur between adjacent conditions. 
The possible orders of the three conditions are as follows: a, b, c; b, e, a; 
c, a, b; b, a, c; a, c, b; and c, b, a. If there are six subjects, then the 
orders are assigned randomly to them. 

In this hypothetical example we could have used only the first three 
orders of the conditions, on the assumption that practice effect is deter- 
mined by position in the series and therefore is equated when each con- 
dition occupies each position in the series. For most experiments this pro- 
cedure is sufficient. Occasionally, and especially in experiments on work, 


it may be suspected that the effects differ for different temporal arrange 
ments of adjacent conditions, For exam 


ple, the learning of task A previous 
to the le 


arning of task B might give a different effect than if we have the 
subjects learn the tasks in the reverse order. This probably would be the 
case when task A is similar but considerably more difficult than task B. 
When we suspect variations in effects of this kind, all possible orders of 
the experimental conditions are utilized. 

This procedure of counterbalancing the order of the experimental 
conditions through the random assignment of the different orders to the 
subjects is the only method that can be used when all of the work re- 
quired of a subject is to be accomplished in a single session. 
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CHAPTER 13 


The Subject in Psychological Experiments 


In the preceding chapter we considered problems in pein 
methodology that are connected with the physical testing situation. 
explored the factors in this situation that affect the production and po 
trol of the experimental variables being used to test the theorem and m 

factors that are involved in controlling and minimizing sources of on 
tion that are not a part of the experimental test of the theorem. In t n 
present chapter we are concerned with problems of experimenta 
methodology that are associated with the responding subject. Again we 
shall explore factors that are manipulated in order to produce the experi- 
mental variables selected for testing a theorem and factors that need to 
be controlled and minimized as sources of error variation. 


PRODUCIN 


G EXPERIMENTAL VARIABLES THROUGH 
MANIPULATIN 


G PREDISPOSING PSYCHOLOGICAL FACTORS 


Behavior cannot be attributed solely to physical stimuli. The ngine 
and amount of response is, in part, a function of the psychological pre- 
dispositions of the subject. In general terms, psychological predispos 
tions can be defined as tendencies to behave in certain ways because 
the particular manner in which inheritance and past experience have se 
the physiological mechanisms to respond. The physical stimulus arouses 
to activity mechanisms already conditioned or set to respond in certain 


a 
f the resulting behavior change cannot ther 


„sical 
rms of the various attributes of the physic 


stimulus eliciting the response. 


As determiners of beha 
subject can be studied as 
theorems. It is obvious that 
trol in manipulating these 
of the physical stimulus. W. 


mounts we desire, It is possible, how 
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ever, to control them as experimental variables in testing theorems, thus 
gaining knowledge about hypotheses we are interested in confirming. 

Some Predisposing Psychological Factors. There is insufficient space to 
describe all of the many different predisposing psychological factors that 
condition behavior. Four general categories deserve to be mentioned, 
namely, interest, attitude, ability, and past experience. 

Interest. Interests are predispositions that influence us to pay attention 
to certain kinds of environmental situations, such as sports, politics, or 
automobile repairing. Environmental objects become associated with the 
satisfaction of our many different needs. They then take on a high stimu- 
lating value for us, and we acquire an interest in them; we pay attention 
to them and spend much time in seeking them. 

It should be obvious that interests function as determinants of behavior 
because they determine in part the kinds of physical-stimulus situations to 
which we readily respond. Interests also condition the length of time we 
respond, the intensity of our response, and other characteristics. In this 
Way interests are related to motivation. For example, we tend to continue 
making a response if we are interested in it. 

Interests are formed in conjunction with all of our needs, therefore 
they are legion in number. They furnish an extensive background of 
Predispositions that can be studied in the role of determinants of be- 
havior and therefore in the role of experimental variables useful for 
testing theorems. 

Attitude. This term refers to an enduring predisposition to react either 
in a favorable or unfavorable way toward a given type of person, social 
group, political issue, and the like. Attitudes are pro-and-con_ predis- 
Positions. They are closely associated with interests—they may develop 
from interests or they may be influential in the development of interests. 
Bias and prejudice are two forms of expression of attitude. 

Attitudes are important determinant predispositions because they may 
Condition the behavior we make to any type of stimulus. We can have 
attitudes about anything—any kind of behavior, issue, person, group, 
object, and so on. i 

Attitudes vary in what can be called psychological strength; that is, 
an attitude may be manifest in terms of behavior that varies in some 
intensive way, as illustrated in the vehemence of the expression, the loud- 
Ness of the voice, the intensity of the response, the prolongation of the 
argument, We usually have little difficulty in observing differences in 
intensity of attitude. vet it is a difficult characteristic to describe. We 
Note that Jones is moe outspoken than Smith, that Johnson sticks to his 
Point regardless of opposition, that Rogers is always appreciative of any 
help given him. These characterizations are manifestations of what we 

“ve called the psychological strength of attitude, and although they are 
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not difficult to perceive, they are very difficult to describe and quantify. 

Ability. This is another term that is used to stand for a great many 
specific kinds of predispositions. Ability refers to power to perform a 
response. In general usage it includes potential power as well as present 
facility. By power to perform is merely meant that resident in the indi- 
vidual’s sensori-neuro-muscular equipment is an established or poten- 
tially establishable organization by which a given reaction is or can be 
made. 

We need not take time to describe the many different abilities pos- 
sessed by a person or the various ways in which his abilities determine 
his behavior. Suffice it to say that ability underlies every response an 
individual performs; it makes possible the response and therefore is an 
important determinant of the response. As a determiner of beh 
be studied as an experimental variable in testing theorems. 

Past Experience. All change that is introduced into the org 
of the individual’s sensori-neuro-muscular equipment be 
ous behavior is to be included in the expre: 
mere act of responding changes the physiolo 
are stable in time and thus they are carried f 
behavior. Everything we do, then, register 
responses, 

By controlling experience it is possible to m 
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For example, we can write the statement over as follows: our experiences 
are conditioned by our abilities, our abilities are conditioned by our atti- 
tudes, our attitudes are conditioned by our interests. The most complex 
way in which we can imagine these predispositions to be interrelated will 
still probably not equal the degree of complexity actually involved. 
With an enormous number of psychological predispositions interde- 
pendently related, it becomes a difficult task for the scientist to control 
any particular predisposition for experimental study. He is confronted 
with the identical problems that we have repeatedly mentioned; namely, 
he must isolate, manipulate, and measure certain predispositions as ex- 
perimental variables while at the same time keeping constant the effect 
of other predispositions functioning as unwanted systematic or as unsys- 
tematic variables, With less direct control over these variables than is 
possible in the manipulation of the physical-stimulus situation, he must 
depend more upon the methods of selection and statistical analysis for 
effecting the relationships necessary for his comparisons. These methods, 
when appropriately and adequately applied, achieve the precision of con- 


trol required for experimentally testing theorems. 
The Use of Predispositions as Experimental Variables. In the last chap- 


ter we examined procedures by which change in behavior can be elicited 
by manipulating the external stimulating situation. Variation in the stim- 
ulus can then be correlated with the resulting variation in the response. 
From the interrelationships discovered between the stimulus and response 
changes it is possible to make inferences about the psychological processes 
of the responding subject. The responses of the subject are considered to 
reflect directly fundamental psychological predispositions as these are 
brought into activity by the stimulus. The primary purpose is to study a 
given predisposition as a psychological process in order to learn its na- 
ture; that is, what it is as a functioning psychological process in the sub- 
ject, what its constituents are, what its precursors are, what variation is 
possible in its amount or frequency of occurrence. The responses re- 
flecting the predisposition are the center of attention. : 

In the present chapter we are interested in predispositions as experi- 
mental variables; that is, we are interested in determining how variation 
in a given predisposition affects some other psychological process. 5 5 
is an active psychological process, represented objectively in some pat 
tern of response, which we examine in order to determine if it changes 
when we introduce variation in some predisposition that we wish to study. 
Of course, this psychological process itself is a predisposition reflected in 
the subject’s behavior, but it is not the predisposition in a ee 
mentally controlled variation is introduced. Rather, it is the bens e z 
is expected to register a change when purposeful variations aa le 
in the experimentally controlled predisposition. We are interested un 
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measuring the nature of any change in the active psychological process 
in terms of amount, intensity, or frequency as we introduce change in 
kind, amount, or frequency in the experimental predisposing factor. An 
example would be determining the relationship between various amounts 
of religious training and incidence of juvenile delinquency. The religious- 
training variable is the experimental predisposition; juvenile delinquency 
is the active psychological process, 

The Isolation of Predisposing Psychological Factors. 
Predisposing Factor as Part of a Complex. It w 
psychological predispositions are complexly in 
very difficult to isolate a particular predisposition for experimental study. 
In most instances, the best we can do is to make the predisposition to 
be examined the predominant one within the complex of variables that 
we find reflected in the behavior, In other words, production of the 
experimental predispositional variable in pure form is usually impossible. 
so we must endeavor to get it to predominate in a complex that we isolate. 
It must be the most important determinant in the complex so we can be 


assured that, for the most part, any results obtained can be attributed 
to it. 


Isolating the 
as indicated earlier that 
terrelated and that it is 


ate a predisposing psycho- 
dices of its existence. We 
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: a sts, inventories, or ratings. 
1 KAS An nik te tudy. Several steps are involved 

proci OF Ve ing gical predisposition as an experi- 
mental variable. First, it is important to define as accurately as possible 
the specific psychological predisposition that is to is 5 died ‘It is 
through careful definition that the interrelationships f th : — 8 dis 
position with others are rey V 


vealed. Often it is i ; 
; 555 5 Possible by careful defini- 
tion to discover signs that minimize any unwanted interrelationships, 


Secondly, signs must be found for detecting the predispositi de- 
fined. Not just any past occurrence that was a Wueste 7 Ea a re- 
disposition can be used. The more closely the signs am Eb 11 iA 
nition, the greater the chances of freeing the experime tal 5 di sel 
tion from other potentially interfering predispositions ntal predisposi 
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For example, if the predisposition is revealed in the amount of schooling 
in the first 8 years of life, there are many persons available for study in 
ign can be found. If the predisposition is revealed in such a 


whom this 
it might be very difficult to find subjects in 


variable as religious bigotry. 
whom the accepted signs can be found. 
The Measurement of Predisposing Psychological Factors. The Prob- 
lem. To obtain the most accurate description of the predisposing factor 
we must find ways for quantifying it. Most predispositions can be con- 
sidered to be magnitudes and to vary in terms of amount. The problem 
is to find signs that are quantitative in nature and that at the same time 
accurately reveal the predisposition. For many predispositions, it is not a 
difficult task to divide them into the two quantitative categories of pres- 
ence and absence and use one group of subjects that possesses the predis- 
position and another group that does not possess it. It is sometimes a 
difficult task, however, to devise accurate descriptions of different levels 
of amount by which we can accurately quantify any functional relation- 
ship that is brought under study. er : 
The Use of Descriptive Indices. Many descriptive indices vary in 
amount, thus making a quantification of the predisposition possible. 
Furthermore, with many of these indices it is safe to assume that the 
quantitative variation in the index is revealing of quantitative variation 
in the predisposition. There are exceptions, however, and the scientist 
must be on the alert to detect instances in which the quantitative change 
in the sign does not correspond with the quantitative change in the pre- 
disposing factor. For example, the length of a course studied in school 
does not necessarily accurately indicate the amount of experience gained 
and retained by a student. An individual may take a course with little 
intent to learn, ‘anid so what little he does learn is not accurately reflected 


in the amount of time spent in training. However. n of — 
to some types of experience is often a sound index tai b 
tive variation in predispositions. Some examples aan 1 oe 
Spent on a certain job, the number afi years of 5 aie = 
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When the sign of the predisposition is being quantified, it should be 
apparent that the units of measurement may be crude and approximate, 
and there may be times when the sign is not even applicable to a particu- 
lar individual. These difficulties, however, do not make me 
impossible. 

The Use of Measuring Devices. Measuring devices that give a quanti- 
tative description of present ability can be used to assess predispositions. 
The effectiveness of present performance can be 
amount of the predisposition. If one 


asurement 


assumed to reflect the 


are corresponding quanti- 
e two persons. Inaccuracy 
performance measure and 
e can be demonstrated to 


anipulating 
problem of 
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factors, it should be apparent that knowledge about the experimental 
subjects is of primary concern. Having defined the predisposition and 
determined the nature of the signs that will reflect it, the next important 
task is to find persons in whom the signs can be clearly discovered in 
the forms and amounts required. Knowledge of persons is the avenue 
through which we eventually gain control of the predisposing factor. 
Failure to get comprehensive and accurate knowledge about potential 
subjects relative to the signs of the predisposition will inevitably intro- 
duce error. Such error is difficult to appraise and therefore difficult to 
amount of its effect on the experimental 
s a far sounder procedure to avoid 
he potential subjects relative to the 


remove because the nature and 
variable cannot be determined. It i 
such error by accurately assessing t 
signs being used to reflect the predisposition. 

Subjects used in the experiment must be selected carefully. The cri- 
teria of selection are usually readily verbalized but are frequently diffi- 
cult to apply. Subjects should be selected for study in whom the signs 
reflecting the predisposition are accurately discernible and in whom the 
signs meet the characteristics of kind, amount, frequency, etc., prescribed 
in the experimental design. Carelessness in the selection of the subjects 
spells failure to manipulate the predisposing variable in accordance with 
the experimental design. 

An Example. Consideration of an example will help to highlight some 
of the points made in the preceding sections. Suppose there is an indus- 
trial training program to be set up for the job of machinist. The follow- 
ing are relevant facts: (1) there is an appreciable number of new em- 
ployees to be trained each year; (2) the employees range in background 
from novice to expert, but the largest number have had considerable 
machine-work experience although usually not of the special kind re- 
quired by the job; (3) there are three different training methods that 
can be utilized; (4) these training methods vary considerably in cost 
and in the time away from the job required of the worker during train- 
ing; (5) it is believed that the effectiveness of the several methods is a 


function of the previous training and experience of the trainees; (6) it 
pt at a minimum if certain 


is believed that cost of training might be ke 
particular kinds and amounts of previous training and experience were 
required of the prospective trainee. 

The predisposing variable is the previous training and experience of 
the prospective trainees who will be selected for training from among 
applicant workers. Evaluating variation in this predisposition requires a 
consideration of both formal and informal training. experience gained on 
jobs, achievements in terms of characteristic rates of work or quality 
of work, ability and experience in repairing machines as well as in oper- 
ating them, and other relevant signs. All such characteristics should be 
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examined in the process of defining the predisposing factor and in dis- 
covering means for quantifying it. 

The variable in which we look for change when we introduce different 
values in the experimental predisposition is the training-methods vari- 
able. What we are interested in determining is the effect that differences 
in the amount of previous training and experience have on the relative 
effectiveness of the several training methods. If it is found that differences 
in the predisposing factor affect one training method more 
than another, this will furnish the information needed. This information, 
together with information on the relative effectiveness and relative costs 
of the methods, will enable management to decide which tr. 
to adopt and what training and expe 
is recruited for the training program 


significantly 


aining method 
rience to require of the worker who 


PRODUCING EXPERIMENTAL VARIABLES THROUGH 
MANIPULATING ANATOMICAL-PHYSIOLOGICAL FACTORS 


All behavior, in whatever form i 
changes in the body. Knowledge 
through a study of the nature 
all previous discussions in this 
not, it has been assume 


t is observed, is dependent upon tissue 
about behavior will then be increased 
of the tissue changes that underlie it. In 
text, whether it has been made explicit or 
d that between what we commonly call the exter- 
nal physical stimulus and the resulting change in behavior as observed 
either in experience or in overt movements, there is activity in the com- 
plex mechanisms of the sense organs, the nervous system, the glands, 
and the muscles, Behavior is the functioning of these mechanisms. 

The General Problem. Studying the correlation of changes in the sense 
organs and the nervous system with changes in behavior is one of the 
oldest research areas in both Physiology and psychology. In recent years 
there has been an accelerated interest in the area as a result of the great 
strides made in the improvement of experimental procedures. Develop- 
ments in the field of electronics have been adapted to the measurement 
of the micropotential changes found in Sense organs, nerves, and muscles. 
Quantitative techniques for measuring molar types of behavior have 
significantly added to the tools required for studying functional rela- 


tionships between integrative behavior and the underlying changes in 
the response mechanisms, j 


In most studies involving the 
response, the hypotheses revoly, 
lem of the “where,” and the pr 
localization of the change un 
concerning the nervous syster 
ticular part of the brain of a 


anatomical-physiological mechanisms of 
e around two problems, namely, the prob- 
oblem of the “how.” The first concerns the 
derlying the response. A typical problem 
m would be the determination of the par- 
rat that is required for the learning of the 
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maze. The problem of the how concerns the nature of the activity un- 
derlying the response. For example, we could ask: What particular elec- 
trical and chemical changes take place in the cortical fields that are active 
during a given response? To date, the attention of the psychologist has 
been primarily directed to the problem of the locus of the tissue change 
underlying behavior. 

We are here concerned with reviewing some of the procedures by 
which the anatomist, physiologist, and psychologist have endeavored to 
find explanations of behavior through the study of response mechanisms. 
We must restrict our discussion to those procedures directly concerned 
with producing change in behavior by manipulation of these mecha- 
nisms. There are important subsidiary procedures, such as those of his- 
phalography, etc., by which greater precision 

and quantification of the behavior changes. 
to the description of all of these 


tology, anesthesiology, ence 
is gained in the description 
It is not possible for us to give space 
procedures. 

Some Procedures Used in th 
profitably begin our study of e 


e Study of the Nervous System. We can 
xperimental procedures for manipulating 
response mechanisms by a consideration of those that are used in the 
investigation of the nervous system. Many of the procedures that are 
used in studying the nervous system are also applicable to nonneural 
response mechanisms. 

The Procedure of the Nerve-muscle Preparation. The neuron is the 
smallest structural unit of the nervous system, and to study its functions 
is to study the mechanics of behavior in their most rudimentary phases 
The activity of neurons can be studied in the nerve-muscle mepa 
tion. This preparation consists of a given motor nerve with its attene 
muscle fibers. Sometimes the preparation is removed from the body; at 
other times the experiment is conducted with the nerve and muscle n 
situ. The application of electrodes on the nerve permits control of the 
duration and intensity of the electrical charge used as a stimulus. Other 
electrodes placed on the nerve or on the muscle enable an accurate record 
to be taken of the nerve-fiber response. Several hypotheses concerning 
the clectropotential characteristics of nervous action have been investi- 
gated, using the nerve-muscle preparation. ; 

The Procedure of the Spinal Preparation. At the reflexive level, Ke 
study the mechanisms involved in simple muscular movements. 15 
simplest form of reflex response is observable in spinal a n 
this preparation the spinal cord is transected in the cervical 3 a 
removing the nervous centers below the cut from — ae à 8 
the higher brain centers. The neural centers below the eve a 
transection remain functional and can be stimulated to activity nome 
the afferent nerves serving them. The muscles below the transection also 


298 Some Individual Scientific Procedures 


remain functional and serve as a means by which the changes in the 
neural centers can be studied. It should be remembered that we cannot 
directly observe the central nervous changes underlying responses. We 
must infer the nature of these changes from the characteristics of the 
stimulus used to elicit them and from the nature of the resulting responses. 

Three very significant hypotheses for explaining simple coordination of 
response, which have been confirmed by many experiments using spinal 
preparations, are the reciprocal innervation of antagonistic muscles, the 
central excitatory state of the motor neuron, and the central inhibitory 


state of the motor neuron. It would take us too far 


afield to describe 
these hypotheses. 


The Procedure Involving Surgical Lesions in the Higher Integrative 
Centers. In this procedure the nervous mechanism to be studie 
dered partially or totally inoperative. This change in the functional con- 
dition of the mechanism is the experimental variable. The effect of the 
change is then noted in some form of behavior, such as sensory experi- 
ence, motor coordination, emotional expression, or activity required to 
learn or solve a problem. The surgery is done under aseptic conditions, 
with the animal under deep anesthesia. The same attention is given to the 


animal as if the operation were being performed on a human being in a 
well-equipped hospital. 


In studying the nervous mechanj 
processes, lesions are produced in e 


brain. The tissue can be destroyed 


d is ren- 


sms involved in the higher mental 
ortical and subcortical regions of the 
by cutting, but it is more frequently 
destroyed by thermocoagulation, using an electric current. Investigations 
of the functional importance of localized 
cortex have required proced 


very restricted areas or regions. One procedure devised for this purpose 


ntrolling the movement of the elec- 


s in the three planes 
electrode in the sub- 
x in reference to these 


and positioning a point 
d electrodes on the corte 


of the lesion, the investigator cannot achieve comparable lesions in dif- 
ferent animal subjects, 


— — pa 
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maps and quantitatively describes the regions destroyed. Only by meticu- 
lously following each of these steps is it possible to obtain reliable facts 
from which accurate statements can be made concerning the functional 
importance of brain regions. 

Some hypotheses that have been investigated by this procedure are: 
mass action (that the effectiveness of behavior is a function of the 
amount of cortical tissue being activated); equipotentiality of brain tis- 
sue (that the cortex is undifferentiated in so far as higher mental activi- 
ties are concerned; i.e., that one part of the cortex is as important as any 
other part); cerebral dominance (that in a given response the cortical 
centers are prepotent over the subcortical centers). 

The Procedure of Stimulation. This is one of the oldest procedures 
for experimentally studying brain functions. The brain is exposed under 
aseptic conditions and electrodes are gently applied to the region to be 
stimulated. Response in the muscles is used as a means of discovering and 
describing any discharges in the motor nerves produced by the stimulus. 
Occasionally with human subjects—persons who are undergoing brain 
heir brain cortex electrically stimulated— 


surgery and consent to having t 
it is possible to record changes in subjective feelings, particularly changes 


in sensory experience. 

Stimulus experiments hav 
brain by inserting needle electrodes in 
nerves have also been studied in vivo 
lation, 

The Electrical-potential-recording Procedure. The development of elec- 
tronics has made it possible to record accurately electrical-potential 
changes from individual nerves, from the deeper nuclei of the brain, and 
from the surface of the cortex of the brain through the skull. The eel 
has been a favorite animal in the study of the functions of the optic nerve, 
and the cat a favorite subject in the study of auditory-nerve potentials. 
Naturalistic stimuli of light and sound are used, The electrical discharges 
along the nerve and from surrounding tissues are registered and photo- 
graphed. In some experiments, the experimental variable consists of 
changes in one or more characteristics of the external stimulus. In others, 
the experimental variable consists of changes produced in the tissue by 
Surgery or drugs. 

The electrical-potential changes recorded from the cortex through the 
skull are called brain waves. In humans these waves are studied in con- 


nection with the subjects mental activity, emotional condition, or pur- 
Posely introduced changes in sensory stimulation. In the operated animal 
the introduction of stimuli in various parts of the nervous system makes 
Possible the study of facilitation, inhibition, and blocking. 

Changes in electrical potential of various rates have been recorded. 


e been conducted on deeper nuclei of the 
to these deeper tissues. Peripheral 
by the method of electrical stimu- 
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When the human subject is in the resting state the rate of change in the 
brain waves is about 10 per second. This is called the alpha rhythm. 
Other rhythms, both slower and faster than the alpha rhythm, have been 
found. 

Some Procedures Used in the Study of Sense Organs. It is not possible, 
in the limited space available, to describe the many procedures that have 
been evolved for studying the functions of sense organs. As stated earlier, 
methods involving electronic devices have greatly impreved the recording 
of sense-organ responses. Changes in sensory nerves, as electronically 
registered, are used to reveal what occurs in the sense organ itself, Much 
knowledge about the eye, the ear, touch endings, and kinesthetic end- 
ings has been discovered through this procedure, 

Present discussion is limited to a consideration of the cutaneous senses. 
In the study of these endings, several procedures have been evolved 
that we have not yet described, 

The Problem. In early studies on the nature of the sensory responses 
of the skin, a question arose concerning the existence of specialized kinds 
of end organs for mediating the different experiences of pressure, warmth, 
cold, and pain. Opposed to this theory is the explanation that each dif- 
ferent kind of stimulus sets up a distinct pattern of excitement in the 
sensory nerves which is then interpreted by the brain. It would appear 
that there are facts to Support each point of view, which leads to the 
conclusion that probably both discreteness of anatomical tissue and dis- 
tinctness of neural reaction underlie the responses of the skin sense 
organs. 


Identifying Cutaneous Endings by Correlating Psychological and Ana- 
tomical Findings. Two facts underlie this procedure, First, there is the 
punctate distribution of Sensitivity in the skin. By this is meant that the 
skin is not uniformly sensitive to different stimuli, If 
skin is explored, it will be found that some spots or 
to warmth, others to cold, others to pressure, and 
Secondly, the anatomist has been able to identify several different kinds 
of encapsulated endings in the skin that could serve as sensory end organs. 
In addition, the skin is tichly served with free nerve endings that could 
also function as sensory end organs. These various endings, particularly 
the encapsulated ones, are not uniformly distributed throughout all skin 
areas. The method of correlation endeavors to determine for a given skin 
area the relation of the location and density of the endings to the loca- 
tion and density of the pointlike experiences, This approach is not en- 
tirely satisfactory, as the correlation is not usually based on findings 


from the same subjects; that is, the anatomical results discovered on one 
rson ar ssociate: i e psyc! i 1 . 

person are associated with the psychological reactions discovered on 

another person. 


a given area of the 
points are sensitive 
still others to pain. 
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The Surgical Procedure of Identifying Cutancous Endings. In this pro- 
cedure a careful study is made of the experiences arising from a given 
skin area, all points of experience being carefully mapped. Then the 
skin is excised, stained, and examined under a microscope. The points of 
sensation marked on the skin are then examined in connection with any 
sensory endings found nearby. It has not been possible by this method 
to establish a one-to-one correlation between the different types of 
sensory experience and the different kinds of sensory ending. 

The Vital-staining Procedure of Identifying Cutaneous Endings. 
Sensory endings can be identified and accurately delimited by vital- 
Although the end organs are desensitized by inject- 
skin, sensation returns after a few hours, so that 
dings are still identifiable. 


staining procedures. 
ing the dye under the 
experiments can be conducted while the en 
Some correlation between type of sensory experience and type of end 
organ has been found by this method, but the facts do not support a 
one-to-one relationship. 

Procedures Using Loss and Recovery of Sensation. In these procedures 
are made nonfunctional and then allowed to re- 
sensory experiences are lost and recovered 
that there is a distinct ana- 
nsory experience may be 


the sensory endings 
cover. It is argued that if the 
in a precise order, this would be evidence 
ach experience. Loss of se 
produced by sectioning the peripheral nerve, by cocaine anesthesia, by 
freezing, or by asphyxia brought on by cutting off the blood supply long 
enough to cause oxygen want. Before the loss in sensation is produced, 
the skin area to be affected is carefully mapped in regard to the different 
all procedures except the one involving the 
the rate of loss of the several sensory 
d. In every method the time and rate 
of recovery of the different sensory experiences is recorded. Results from 
the different methods are not in complete agreement. For example, the 


order of loss under cocaine anesthesia is first cold, then warmth, and 


last pressure, with the time of disappearance of pain varying in different 
experiments, In the method of asphyxia, the order of loss is first light 
pressure, then cold, then dull pressure, then warmth, and finally pain. 
The Manipulation of Anatomical-physiological Mechanisms by the 
Use of Drugs. Some Uses of Drugs. We have already alluded to the use 
of drugs in the study of cutaneous sense organs. Drugs have also been 
used widely to desensitize, block, excite, and otherwise affect different 
response mechanisms. In experiments with animals, curare has been an 
invaluable aid in work with muscles. For example, it has been used in 
muscles in experiments on conditioning. It has also 
t. The effects of Benzedrine have been 
her mental functions. The effects of 


tomical structure for es 


cutaneous experiences. In 
sectioning of the peripheral nerve, 
experiences is carefully determine 


paralyzing somatic 
been used as a cortical depressan 
studied in connection with the hig 
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caffeine, alcohol, and the barbiturates have been studied on a wide 
variety of responses. Glandular products have been used in investigations 
of certain kinds of emotional reactions, 

Controlling the Factor of Suggestion. One of the most stubborn prob- 
lems that has confronted the experimenter in the use of drugs on human 
subjects is the control of the stimulating or inhibiting effect that is sug- 
gested by the use of the drug. The subject, being aware, at least in part, 
of the nature of an experiment, may react to the administration of the 
drug in terms of what he believes the effect should be. This is to say 
that when the drug is given, he may d 
the direction that is in agreement with 
effect. For example, if an individu 
does not have a deleterious effect o 
a task such as mental arithmetic he will work extra h 
in order to demonstrate that he is not affected by the drug. 


tion of drugs. Caffeine is administere: 
in coffee, and a placebo, looking exa 


MINIMIZING THE CONTRIBU 
PSYCHOLOGICAL FACTORS 
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mental design to plan procedures by which unwanted factors will be 
prevented from exercising a determinant effect upon the measures se- 
lected for evaluating the experimental variable. 

For purposes of description, some of the major psychological factors 
needing to be controlled and procedures used in dealing with them are 
considered under three headings; namely, motivation of the subject, 
facility of the subject in adapting to the experimental conditions, and 
predisposing background factors of the subject. 

The Control of Motivational Factors. Special procedures for maintain- 
ing high motivation in the subject are important for obtaining a repre- 
sentative performance. The subjects response to the physical stimulating 
situation is in part determined by his desire, willingness, or even com- 
pulsion to react. 


Animal Subjects. In animal experim 
the principal drives that are manipulated to motivate the subject to 


respond, The degree of the subjects hunger can be crudely controlled by 
regulating the amount of food. Thirst is manipulated by regulating the 
amount of water. In either case, within limits, the longer the incentive 
is withdrawn, the more likely is the animal to respond to the experi- 
mental conditions presented to him. The usual effect of electric-shock 
stimuli is to increase the motivation of the animal to respond, but con- 
trol of the level of motivation is much more difficult than with food or 
water. There are wide differences in susceptibility to shock stimuli among 
different subjects (both human and animal). Also, within a given experi- 
mental period the sensitivity to shock of the same subject may change 


markedly, 
Human Subjects. The problem of 


in human subjects is also closely associate : 
mans usually volunteer to be subjects and are not under compulsion to 


cooperate. This does not mean, however, that they are equally moti- 
vated to respond under the experimental conditions set for them. Sub- 
jects approach the task given vith different motives, sets, and 


them v 
intents, Furthermore, differential change in their motivation occurs as 
the experiment proceeds, some s$ 


eration, some show- 
ing les £ j a 
The 1 of the individuals aspirations and prestige is — 
in nearly all experiments involving the higher ene ee Indi- 
viduals differ greatly in their aspirations. Some set — begs ions 5 
most tasks they undertake; others do not. Some = their a in 7 0 
every task they perform; others do not. In most a the subje 
wants to know about the quality of his performance and how it corp 
Pares with that of other subjects. Some individuals develop anxiety 


when they are not satisfied with their performance. 


ents, hunger, thirst, and pain are 


getting representative performances 
d with their motivation. Hu- 


howing more coo 
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The experimenter’s control over the motivation of human subjects is 
usually by way of instructions, although he can also utilize some forms 
of reward. Children can be highly motivated if promised desired objects 
like candy. Adults can often be highly motivated by being promised a 
complete description and explanation of the results following completion 
of the experiment. 

Special sets and intents are controlled by standardized instructions. 
The use of such instructions guarantees that everyone will be given the 
same explanation, and thus no subjects will have 
others because they received extra aid. The instructi 
prehensive, covering all areas whe 
should be written at a level that c 
particular subjects to be used. Th 
study to determine if they actually function in a satisfactory way. 

Provision should be made for giving special instructions when the sub- 
ject’s interest flags. The experimenter should prepare ahead of time and 
put into standardized form the type and amount of extra stimulation 
he believes it possible to give a subject whose interest is flagging. Leav- 
ing such special instruction to the exigencies of the moment when they 
are urgently needed runs the risk of changing the experimental condi- 
tions and invalidating the testing of the theorem, 

The Facility of the Subject to Adapt to the Experimental Conditions. 
The Problem, In most psychological experiments there will be features 
of the procedures, materials, or apparatuses that will be totally unfamiliar 
to the subject, and this unfamiliarity should not function as a determinant 
variable. Many subjects will not have had experience with specific tasks 
comparable to those they will be called upon to perform. For example, 
in an experiment on concept formation the subject will bring to the 
laboratory his past experience with concepts, abstractions, and the like. 
but in addition he may be exposed to the relatively unfamiliar stimuli of 
Chinese characters, printed marks called radicals, memory exposure de- 
vices, and other content features peculiar to the experiment. We must 
be aware of the fact that different subjects will not adjust to these un- 
15 3 with the same ease and readiness and that because of 
pt andersins ontana ae Dene th ire an 
then function ina eng sera subjects, These vorrig maay 
factors may change duria : arly, attitudinal and —_ 
ie paddies ws a n ways not associated with 

* Special steps must then be taken 
unwanted variations as may arise 
ing conducted. 

Practice. If the design of an experiment 
ually conversant and facile with the par- 


an advantage over 
ons should be com- 
rein the subject must respond. They 
omes within the understanding of the 
ey should be given a tryout in a pilot 


within the particular investigation be 
The Procedure of Preliminary 
requires the subjects to be eq 
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ticular materials and procedures to be used, then the subjects are given 
preliminary practice on these or similar materials and procedures before 
being actually started on the experiment. The purpose of the preliminary 
training is to bring subjects to a comparable level of familiarity before 
the experiment. 

Suppose that the experiment demands that a subject operate two con- 
trol wheels similar to those found in a machine lathe. For example, the 
two-hand motor coordination test requires the subject to manipulate two 
control wheels, one with each hand. The task confronting the subject is to 
keep a movable pointer, which is controlled by these wheels, in contact 
with a target that is made to move automatically at varying rates through 
Left and right movement of the pointer is 
controlled by the left-hand wheel and forward and backward movement 
of the pointer by the right-hand wheel. It is very difficult to learn how to 
Operate this test merely from written or oral instructions. A minute's 
practice manipulating the controls, however, is usually sufficient to de- 
velop an understanding in the subject of what he is expected to do. 

Very few experiments are so simple in design that all subjects are 
ready to perform the required tasks without practice. Nearly always, 
then, a fore-exercise, practice exercise, or preliminary training is given 
before the subject is required to respond to the experimental conditions 
in which the theorem is under test. f ; 

The Procedure of Completed Practice. The purpose of this procedure is 
a level of performance where no further improve- 
ment is expected from additional practice. IE the experimental stimulus 
is likely to have only a slight effect on behavior, great difficulty may be 
encountered in detecting this effect and in separating change resulting 
from the experimental variable from change resulting from 1 a 
nants. Any particular response to be used as a test of the experimenta 
variable should then be well established in order to minimize changes in 
it from irrelevant determinants. For example, if the rate of the color- 
to study the possible deteriorating effect of 
the ingestion of alcohol, it is necessary to remove other * 
the rate of color naming, such as practice effect, fatigue, boredom, atti- 


tude of subject toward use of alcohol, ete. One of hie ves namely 
Practice effect, is controlled by the method of completed practice. 
: loubtful if a subject can be trained 


In the case of complex tasks it is d i [ i 
to a maximum of his performance within the limited REE at the dispor 
of both experimenter and subject. In conducting an exper a a he 
effects of high temperatures on mental pete one eg 1 — — 
found i v le t the subjects trained to a sta 2 

it very difficult to ge 3 
level. The task was the addition of two I- place numbers, pace actually 
is very simple in nature as mental tasks go. The individual problems were 


a prescribed irregular course. 


to bring the subject to 


naming response is to be used 
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arranged 15 to the row and 25 rows to the page, and several pages were 
prepared with different combinations and sequences of problems in order 
to prevent memorizing of sequences of answers. The subjects were prac- 
ticed 20 minutes a day. 

In the case of one subject, six distinct practice levels were attained and 
surpassed. After practice was extended through about 3 months, the 
subject was still showing improvement. Needless to say, the subject was 
not used in the experiment. But suppose he had been exposed to a high- 
temperature environment while working on the addition problems, and 
suppose that there was no difference in his performance between the 
high-temperature and normal-temperature conditions; it could not be 
concluded that the high temperature had had no e 
have been possible that a deleterious effect from 
was canceled by further improvement in the funct 
much as any effect from high temperature w 
it was extremely important that a stable pr. 
subject. 

Minimizing Variation 
Problem. In every experiment there wil 


ffect because it could 
the high temperature 
ion of addition. Inas- 
as expected to be rather small, 
actice level be reached by the 
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If the attitude of subjects has a direct bearing on the nature of the 
investigation—as, for example, stereotyped racial attitudes in a study in- 
“volving judgments about the behavior of different racial groups—the 


en manipulate this attitude to equalize its contribu- 


experimenter can oft 
tion among the several conditions of his investigation by a random selec- 


tion and assignment of the subjects to the different experimental condi- 
tions. In the discussion of the use of alcohol, the attitudinal factor of 
suggestion was mentioned. Left uncontrolled this suggestion factor prob- 
ably would contribute as an unwanted systematic attitudinal variable in 
most experiments in which human subjects were used. Control of this 
factor by means of random selection and assignment of subjects to the 
experimental conditions does not remove the effects of suggestion but 
distributes them randomly to the several conditions. 

The Control of Interest, Ability, and Past Experience. In contrast with 
attitude, factors conditioning interest, ability, and past experience are not 
so readily held constant, and frequently special precautionary procedures 
must be taken to control their possible determining influence. For many 
investigations dealing with the nature of psychological variables, it is 
sufficient to follow the rule of selecting subjects who have the minimum 
of interest, ability, and past experience required by the tasks of the 
experiment, For example, the rule of minimal facility is usually used in 
studying the characteristics of sensory experience and the sensory 
thresholds, the basic facts of learning, the laws of memory, and similar 
topics, The interest, ability, and experience of the subject are then sup- 
Posed to contribute to the understanding of the procedures to be followed 
by him; but they are not supposed to contribute differentially to the 
nature of the results unless they themselves are part of the experimental 
variable under study. , r 

When Eaa are to be made concerning the relative Eae 
oF swo matado, such as two a aod af elect 


abilities, and experiences of the subjects ha 3 
ing the results. 1 is then necessary to follow a manipulatory procedure 


that will distribute these effects equally to all phases of ga aa 
tion. Some of the better-known procedures for achieving this distribution 


Will now be considered. — 
Procedures for Equating Interest, Attitude, 


ence. The Procedure of Matched Groups. In res i i 
the procedure involving matched or equated groups is very frequently 


used. When we are interested in discovering the effects 5 — af 
controlled variation in a specific factor to which we pepo y e our 
Subjects, we want to be assured that the results obtains ted 1 
tionably attributed to the variation that we experimentally intro uce. 


achieve this we utilize two groups of individuals in one of which we in- 


Ability, and Past Experi- 


earch with human subjects 


308 Some Individual Scientific Procedures 


troduce the variation and in the other of which no variation ee 
The group in which the variation is introduced is called the experir 
group, while the other is called the control group. tees ttc l 

Suppose we are interested in knowing if 2 years of college : 1 
conducive to faster learning in aircraft pilot training. We would Ta 
groups matched on such factors as age, intelligence, nee p — 
etc., one of which would have 2 years of college training (the in, i 
mental group), the other of which would be high school 5 ‘> ra 
no college training (the control group). It is obvious that any di E 
ences found between the two groups in their rate of learning to a 
pilots cannot be attributed to their differences in respect to the expert: 
mental variable of college training unless we make sure that the tw a 
groups are alike with respect to other relevant factors that ene 
might contribute to these differences (age, intelligence, mechanica 
ability, etc.). 8 

The fundamental purpose in forming matched groups is to gain greater 
control over the relevant factors that might enter the experiment to pe 
duce a specific bias in the results or to increase the sampling error. W i 
want to obtain two groups that are homogeneous with respect to a 
variables that might produce differences in behavior that will affect m 
any way the difference to be expected from the experimental variable. 
It is apparent that if two groups are homogeneous in respect to relevant 
characteristics at the beginning of an experiment and a difference be- 
tween them occurs after the introduction of an experimental variation, 
then the difference must be ascribed to this variation. In using the method 
of matched groups the problem is to determine the particular relevant 
factors that, when used as a basis for equating the individuals, will pro- 
duce the greatest homogeneity with respect to the unwanted relevant 
factors that might contribute a systematic effect. 

To equate two groups in such factors as interest, attitude, ability, and 
past experience, special procedures are used in assembling the two 
groups. One procedure is to randomly assign the subjects to the control 
and experimental groups. The assumption is made that through a random 


jects chances are about equal that any relevant 
f Pie 


e two groups to about the same extent. A 
second procedure requires the forming of pairs of individuals who are 


about equal in terms of experience, ability, ete, Then by random assign- 
ment, an individual of each pair is placed in each of the two groups. 

A special form of the matched-group procedure is that of consanguine- 
ous pairing, in which siblings, fraternal twins, or identical twins become 


the pairs, each pair then being divided between the control and experi- 
mental groups by some random method of assignment. 


Q 
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aio Procedure of Self-control. Another procedure for manipulating 
e ee radeon oe ne es an at 

8 he same indi- 
viduals is measured under all experimental conditions, and any differences 
in behavior are ascribed to the differences among the two experimental 
conditions. The effect of relevant disturbing factors in the form of interest. 
attitude, ability, and experience is largely equated by using the subest 
in every condition. Other disturbing factors may arise, however, such as 
those of fatigue and practice effect, and these must be held constant or 


eliminated before the findings can be attributed to the differences intro- 


duced in the experimental conditions. 
The Procedure of Successive Practice. 
cedure of self-control. The subject engages in the several conditions of 
the experiment, repeating them several times in different orders so as to 
equalize the effects of fatigue, practice, changes in incentive, and the 
like which arise during the experiment. The factors of interest, attitude, 
ability, and past experience are presumed to be held constant through 
self-control. It will be noted that the procedure discussed in Chap. 12, 
of counterbalancing experimental conditions to minimize factors arising 
in the order of these conditions, exploits the method of successive practice 
in distributing equally to all iation arising from factors 


conditions any vari 
not part of the experimental variable. Predisposing interest, attitude, 
ability, and experience factors 


This is an adaptation of the pro- 


are held constant through the procedure 


of self-control. 
The procedure of successive practice is used in experiments on trans- 
fer of training. Here the problem is to find whether training on task A 


influences the performance on task B. If the transfer and control groups 
are matched, then the transfer group performs task A followed by task B, 
whereas the control group performs only task B. The procedure of self- 
control is functioning in the transfer group. In some transfer experi- 
ments it may be impossible to form matched groups, in which case it is 
necessary to devise and utilize equated tasks. The procedure of self- 
control must then be used, the subjects performing task A and then task 
B. It should be apparent that under these conditions the two tasks of 
A and B must be equated, otherwise there is no way of determining 
whether any obtained difference is a transfer effect or is due to a varia- 


tion in the tasks. 
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Andrews (ed.), “Methods of Psychology,” chap. 15, John Wiley & Sons, 


Inc., 1948. A brief review is given of some of the methods used in the 
tionships between behavior and different structural 
nervous processes and other bodily functions. 


tudying the development of 
the nervous system, behavioral changes correlated with delimitation of 


the nervous system brought about by pathology or special operative tech- 
niques, changes incident to stimulation of selected parts of the nervous 
system, electropotential changes detected along nervous pathways during 


various forms of response, and behavioral changes correlated with various 
conditions in the internal environment of the body. 


CHAPTER 14 


Some Procedures for Quantifying Behavior 


The most profitable form of description involves the measurement of the 
phenomenon under study. Measurement requires us to apply some mean- 
ing of numbers to the phenomenon. In order to measure we must dis- 
cover a characteristic of the phenomenon that exists in more than one 
value or amount, that is, a characteristic that has magnitude, and then 
We must express the amount of the characteristic in some acceptable unit 
of measurement. If the characteristic is amenable to the number mean- 
ing that we wish to apply, then by expressing the characteristic in quan- 
titative terms we achieve a form of description of the phenomenon upon 
which wide agreement can be obtained. 

Before discussing specific quantitative procedures, we should have in 
mind what is meant by the phrase behavior change. Interpreted broadly, 
We mean any change we can observe in the individual. There is then a 
very wide variety of responses for which we shall need quantitative 


procedures, We shall need to measure objectively expressed behavior, 
] we shall need to measure sub- 


that is, all kinds of overt responses, anc 
jectively expressed behavior, that is, all kinds of experience. We shall 
lasses, which extend 


need to subdivide the overt responses into many C 
from the simple reflexes at one end to 


along a continuum of complexity 

the very complex behaviors observed in the interpersonal relationships 
of social adjustment at the other end. We shall also need to subdivide the 
subjective reactions into many classes, which extend from the sensory ex- 
Periences of the more simple type to the complex mental processes in- 
Volved in the highest forms of reaso 
_ At the outset we should understan- 
is directly conditioned by the particu 
to quantify. In contrasting objective an 
find the objective more amenable to mea 
and mental behavior, we shall find the moto 
ment, In contrasting reflexive and interpersona 


reflexive more amenable to measurement. In contras 
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of single isolated characteristics with the measurement of integrated 
molar forms of behavior, we shall find the former more amenable to 
measurement. 

It is not our intention in this chapter to describe a particular measure- 
ment procedure for every one of these several kinds of behavior change. 
Rather, the principal types of measurement will be discussed within 
which various adaptations can be found that are applicable to the many 


kinds of behavior change that will need to be measured. 


SOURCES OF ERROR IN QUANTIFYING PSYCHOLOGICAL 
VARIABLES 


We can better understand measurement procedures if we are aware of 
some of the ways through which error may be introduced. Several times in 
previous chapters we have had occasion to indicate how mistakes may 
occur in the application of the scientific method. When committed, many 
of these mistakes will be reflected as errors in the quantitative description 
of the variables. 

The Definition and Delimitation of the Variables as a Source of Error. 
We should carefully define and delimit the behavior to be studied before 
selecting the appropriate measurement procedures, It might be said that 
we cannot measure any behavior that we do not understand. This state- 
ment is not entirely correct. One purpose of quantifying a variable is to 
gain a better understanding of it, Measurement is a step on the way to 
understanding. It is necessary to try our hand at measuring a poorly de- 
fined variable as a means of gaining an improved definition of it. Right 


from the beginning, however, the behavior to be quantified should be 
described as accurately as possible, 


Tnadequate definition of the problem means errors in determining the 


Having decided what kind of be- 
st then determine what specific re- 
this behavior and carefully define 
l ugh these individual responses that 
we measure the itions involved in testing the theorems of the 


a he descriptions so cl 
any qualified observer can understand them 


knowledge of the specific resp i ari be 
left to the observer to decide duri ene, Hite ae 
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urement. In the use of attitude scales, the specific attitude to be measured 
must be selected and defined by the experimenter. Accuracy of measure- 
ment is dependent upon having a single attitude represented on the con- 
tinuum. All statments on the scale must revolve around one and only 
one attitude. If more than one attitude is represented on the continuum, 
then the particular response called out from a given subject by the scale 
will be a function of which attitude is activated at the time the subject 
makes his decision. For example, in reference to such an issue as prohibi- 
tion there may be several attitudes. An individual may interpret the 
statements in terms of the issue of individual freedom and be unfavorable 
toward prohibition because it is restrictive of the rights of the individual. 
Or he may interpret the statements in terms of public morals and be 
favorable toward prohibition because he feels it would curb drunkenness. 
Or he may interpret the scaled items in terms of the prescriptions of his 
religion and favor prohibition because it agrees with his religious creed. 
These three reactions reflect three different attitudes, not one. Which one 
of the three will be reflected in the subject’s responses to a set of scaled 
statements can be controlled to a large extent by adequately defining and 
delimiting the attitude variable. Further control is achieved in a careful 
selection of the statements. This selection should be based on quantitative 
Procedures for determining the validity and consistency of each item and 
not upon subjective judgments made by the investigator about the validi- 


ties and consi ies of the items. 
s consistencies of the 
The Registration of the Behavior Change as a Source of Error. As we 
lear nil aration refers to the procedure of making a record 
earned earlier, registration refers ero 3 
of what happens in an experiment. A broad n 0 e ration 
5 g 1 i „hic! 0 7 e perl- 
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i ici istical 

i ipulable in the logical and statis 
manent by registration are manipu l ‘ eee e 
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results, Registration, then, directly shapes the particular quantita 


scripti -cue from any study. ; — 
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Registration is associated with the standardization of the o oan 
conditions so that a representative sampling of the behavior of each 
subject is obtained. It aids in minimizing variations in experimental pior 
cedures which might result in differences in exposure to crucial experi 
mental stimuli. It thus determines how accurately the behavior of each 
subject is represented in the quantitative analysis. , 

In addition to recording the functioning of systematic variables, regis- 
tration includes the recording of changes resulting from unsystematic 
variables. It thus is part of the procedure by which chance factors are 
brought under control for purposes of evaluation. It is the basis on which 
the chance factors are separated from the experimental factors, a step 
that must be taken before any quantitative evaluation can be made of the 
theorem being tested. 

With this broader conception of registration it should be apparent 
that errors of registration can directly result in errors of measurement. 
Failure to register accurately the quantitative variation in either the ex- 
perimental stimuli or the consequent responses makes possible errors 
in the measurement of the functional relationships under study. Failure 
to record variations in factors not part of the experimental variables re- 


MEASUREMENT BY FREQUENCY OF OCCURRENCE 


— Ř— 
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Then every occurrence of lateness satisfying this definition is considered 
to contribute equally to the judgment about the worker's effectiveness. 
If the length of time the worker is late is made part of the definition, 
then the occasions of lateness are classified according to amount, using 
some such categories as the following: 0-5 minutes, 6-10 minutes, over 
10 minutes. Here an amount characteristic is being used in association 
with the frequency of occurrence of the late behavior. 

The Importance of Accurately Defining the Variable. This point needs 
little further amplification. It should be obvious that if the unit of be- 
havior is the unit of measurement, there can be accurate measurement 
only when there is accurate definition of the unit of behavior. Accurate 
observation of the frequency of occurrence of the behavior is impossible 


when the behavior to be recorded is ambiguously defined. If we are to 


count the manifestations of a given unit of behavior we must know ex- 


actly what it is that we are observing and counting. 

The Widespread Use of the Method. The method of measurement by 
frequency of occurrence is useful in measuring all forms of behavior 
change. Its application in measuring subjective experience is seen in the 
determination of sensory thresholds. The frequencies of occurrence of the 
judgments “sensation present” and “sensation absent” in association with 
variations in the intensity of the physical stimulus give the necessary data 
for the computation of the absolute intensity thresholds of sensation. The 
use of the method in the measurement of higher mental processes is seen 
in the counting of the number of right answers in tests of intelligence or 
the number of correct solutions of problems in reasoning. The use of the 
method in studying motor coordination is seen in the measurement of the 
time involved in the performance of a reaction, or the number of correct 
responses performed in a given time period. 

The method of frequency of occurrence 1S basic to most ot her measure- 
ment procedures, as will be made apparent in the following sections. 


MEASUREMENT BY MEANS OF HIGHLY STRUCTURED TESTS 


In previous discussions we have described the procedures involved in 
the testing of theorems and have referred to the situations created for this 
eck e the word test in the 


i ve shall us 
Purpose a ts. In the present section W 1 
sense of 8 the problematic situations devised for the purpose of 


measuring ability. This use arose in connection with the pe 15 
Psychologist to measure intelligence, and is now applied to the evaluation 


of all kinds of ability. 

The word structure refers 5 
subject are determined and chan i 
situation presented to him. In highly structure 


the degree to which the responses of the 
neled by the nature of the stimulating 
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subject are forced to follow certain patterns and therefore Ten Le 
particular characteristics required by the test situation. When relativ 7 
unstructured situations are used the subject is allowed great feei i 
his response, and therefore the behavior he manifests is less metro 
by the nature of the external stimuli and more determined by the partict 
lar organization of his personality. ; > 
Tests as Measures of Psychological Predispositions. In most interpre- 
tations of test theory the objective of testing is to measure fundamental 
predispositions to respond, both those that are little affected by past 5 
perience and those that have been greatly influenced and shaped by Te 
experience. The responses of an individual in any test situation can be 
referred to the following general factors: the current sensory stimulation, 
both external and internal, including symbolic stimuli; the potentials for 
response and the predispositions to respond that are attributable to 
natural endowment; and the variations and changes in natural endow- 
ment effected through the past experiences of the individual. i 
It should be obvious that in the testing situation the investigator is 
limited to manipulating certain segments of the current sensory stimula- 
tion. Through these manipulations he endeavors to introduce control 
over the overt and subjective responses of the subject. Most of the internal 


sensory stimuli in the form of intraorganic processes are beyond the 
experimenter’s direct manipulation, 


What a subject does in a test situa 
function of the predispositions to re 
areas represented in the test ite 


tion is supposed to be a direct 
spond that fall in the specific narrow 
ms. Actually, the responses of the subject 
are also a function of three other factors, namely, predispositions other 
than those being measured, internal stimuli of intraorganic origin, and 
objective sensory stimuli not directly associated with the test items. Each 
of these three factors is a potential determiner of behavior in any test 
situation, and when any one of them is known to have significantly con- 


tributed to the subject’s performance it is usually impossible to deter- 
mine the exact meaning of the test results, 


Areas of Measurement by Tests. The scientific 
esent the m 
d and measured 


5 i © been developed are here briefly described. 
Ability. Ab i nat refers to the power of the indi- 
vidual to respond. It i 


s used to include all other more restrictive terms 
that refer to the individuals response power, 


Proficiency. Proficiency stands for w 
moment, that is, without fu 
whether the response 


or by the experie 


hat the individual can do at any 
rther preparation. No reference is made to 
d by inherited endowments 
cy we are interested in find- 
can manifest at the moment. 
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Potentiality. This ability refers to reactions that the individual can learn 
to make if given the experience or training necessary but that at the 
moment he cannot perform. It signifies future promise, and is primarily, 
but not exclusively, conditioned upon inherited endowments. 

Capacity refers to the highest level of ability to be expected from an 
individual after ample training and experience. It also is used in the 
sense of promise. 

Aptitude. Aptitude is a broader term than potentiality. In addition to 
potential ability, it includes factors of proficiency, personality, tempera- 
ment, and interests. It is the meaning usually applied when we speak of 
the promise of an individual to succeed in some global stimulus-response 
situation like a vocation. 

Personality. Personality is the sum total of an individual's characteristics 
that summarizes his unique adjustments to his environment. It includes 


character, temperament, and, at times, attitude and interest. Behavior 
ationships with other members of society is 


Sometimes personality is conceived 
t other times as a unitary whole with 


involved in interpersonal rel 
an important aspect of personality. 
as a collection of interrelated traits; a 


many divergent modes of expression. 
ay a person looks at some aspect 


Attitude. Attitude is defined as the wa 
of the world around him. It is an organization of all the perceptual, emo- 
tional, motivational, and cognitive processes resulting from his experience 
with that aspect and with phenomena related to it. For example, the atti- 
tude of an individual toward war is an organization of the — 
processes resulting from his past experiences in which he was expose 
to situati relate War. ay 

Dae oan of motives that predispose an individual to pay 
attention to certain features of his environment rather than other features 
are called interests. They involve inherited 1 S as yal as = 
varied past experiences associated with the particular environmenta 
features, 

The above are gener 
the measurement of rather specifi 


categories considered above. An € 


al categories of response. Tests are directed toward 
ither specific types of responses within the broad 


xample of the more restricted areas of 


ity is in the subcategories of 
testi N : d of ability is found in t 9 ; 
sting developed in the field o are S Psychology Program during 


intellectual tests developed in th 

the ita 1 =. Each of the subcategories, of course, ean 15 

further subdivided into less inclusive areas. The ee are sA 

ability, mathematics, reasoning, visualization, mechanical a 2 5 isle 

tion, perception, perceptual speed, form . n ý a 

estimation, spatial ability. orientation, set, ait a en 
The Manipulation of Physical Stimuli as Tes jects. In s 


Situations manipulation of physical stimuli plays the — a 
eliciting the desired psychological behavior . 
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sensory response the quality of the stimulus usually autcinatiaally 5 
mines the sense modality of the reaction, and other characteristics 0 $ n 
response are associated with such features of the stimulus as its intensity, 
duration, location, distance, speed, repetitive nature, etc. , ; 4, 

Manipulation of the physical aspects of the stimulating situation is 
necessary in the measurement of muscular activities, such as those in- 
volved in tests of motor speed, muscular coordination, and muscular 
strength. The timing of the response is in part a function of the speed of 
presentation of the stimulus. This is particularly true in pursuit tasks in 
which radical changes in the rate of the response can be produced by 
variations in the rate of movement of the stimulus target. The difficulty 
of the task in tests of muscular response is conditioned by the speed of 
movement required of the subject. Tasks requiring excessive speed may 
eventually produce incoordination, fatigue, anxiety, and emotional upset; 


a spread of change in response far beyond the rate and accuracy of the 
response that are actually under study. 


anipulation of the stimulus is accomplished 
from simple sensory tests in which a single 
g merely a verbal report from the subject 


n the following tests, manipulation of the 
very important: two-hand coordination, 


ness, self-paced path tracing, bimanual 
it, blindfold bimanual coordination, visu 


confusion response, foresight 


tential meanings, The par- 
the subject will determine 
‘pressed and measured. It is 
understandable to the sub- 


Be is not measuring the particu- 
lar ability for which it is intended, 
Level of difficulty in a Psychological abili 
nat i i 
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degrees of abstractness into the test items. A level of abstractness char- 
acterizing relationships among higher order conceptual meanings can be 
found that will challenge even the most brilliant minds. 

The factor of difficulty requires careful study in the preparation of 
any symbolic test of ability because of the probability that it might 
elicit interfering emotional reactions. When the difficulty extends much 
beyond the ability of the subject, anxiety, worry, discouragement, and the 
like may result. In this instance the subject’s responses are being partly 
determined by emotional predispositions rather than being solely deter- 
mined by the ability predispositions which the test was designed to 
measure. 

Literally dozens of different kinds of symbolic verbal test items have 
been devised for eliciting expressions of predispositions. It is possible 
to mention only a few of them. The following ones should be sufficiently 
familiar to the reader to need no further explanation: mathematical prob- 


ms, word-meaning puzzles, synonyms-antonyms, 


lems, reasoning proble d 
ation, speed of 


analogies, proverbs, substitution problems. object identific 
perceptual motion, problems of recall and recognition, problems of set 
and attention, and problems of information. a : 

Some types of items require the use of pictorial presentation. Photo- 
graphic reproductions of objects, people, machinery, landscapes, F 
may be used. Maps and charts are other media that have proved extremely 
successful. In order to elicit the particular type of discriminatory re- 
sponse that is to be evaluated, the pictorial representation is accompanied 
by verbal test items which are used to focus 5 of the subject 
on articular concepts that he is to manipulate. 

omc 3 Requiring Measurement. As * 1 
noted, the nature of the subjects response is a function of the Ka 2 
stimuli presented, including the kind of materials or apparatus that he 
must manipulate. In some test situations the primary ha joes a i me 
ure the overt response for its own sake, and in others the over! poten 
is used as an indication of the effectiveness of some a 1 e 
judgment that is reflected in the response. For poh) e, in en we T 
steadiness, we are interested in observing the actua amount e 

i testing the accuracy of response in 


i i in 
ment in the hand itself, whereas, ie 2a 
solving reasoning problems, the overt behavior is used as a basis for 
: mental processes called reasoning. 


Making i about the esse Ree , 
ot heapke require the quantitative description of ert 
response 3 A few examples will make this pg eee 
simpl. ion-time test, the speed of movement is primary, > direct 
ee ES 4 ant. In the simple discrimination- 


ana orenk aa oat "ai 5 of movement are important and 

ion-ti ctii : 
reaction-time test. speed and dire ae irra ES 
extent is unimportant. In certain more complex discrimination 
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tests, all three characteristics of response are important. In pursuit re- 
sponses, the rate and direction of movement are of primary concern, ex- 
tent being of minor importance. In the steadiness test, avoidance of move- 
ment or the maintenance of a constant position is the important function. 

In most tests of intellectual ability, interests, attitudes, and personality, 
the overt responses are reduced to very simple forms so the subject can 
concentrate on evolving answers to the problems presented to him. It is 
unnecessary to review the characteristics of predispositional responses in 
these areas that need description, as they have been referred to in previ- 
ous discussions. Suffice it to say that the following are important char- 
acteristics: the nature or kind of response, the rate of response, the 


accuracy of response, and the level of difficulty at which the subject can 
respond satisfactorily. 


The Recording of the Subject’s Response. The 


recording of the sub- 
ject’s reactions is not 


a major problem in pencil-and-paper types of tests. 
The response usually consists of making some simple mark, such as a 
number or check, in an appropriate space near the test item. 

In apparatus tests, registration sometimes may become a very difficult 
problem to solve. The nature of the recording procedure depends upon 
the type of response to be registered. Photographic and polygraphie tech- 
niques give the best continuous recording of movements. In the poly- 
graphic method, one or more writing pens register on a continuously 
moving paper tape. One pen is connected with a timer and registers a 
time line in appropriate units such as fifths or halves of a second, Other 
pens can be activated by the subject as he responds to the problem he is 
solving. The experimenter also may use some of the pens to record facts 
about the subject's behavior. The analysis and evaluation of the records 
are a very time-consuming task, particularly if many subjects are tested. 
In a relatively short experimental sitting several yards of tape per sub- 
ject may be accumulated. Variations in the recordings of each pen must 
be measured which requires much time. 

a complex response are measured 
separately, while at other times a single score is used to represent the 
peri ; aking its contribution to this single 
score, ö tk performed by different muscle 


i ecution of a complex movement may 
in understanding the effectiveness of response in 
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by ineffective reactions in some part function, and such ineffectiveness 
may be revealed if the part functions are scored separately. 

The Evaluation of the Subject’s Responses. The meaning of the test 
performance of any individual or group of individuals is determined by 
comparing it with what are called norms of performance. Test norms are 
the cumulative test performances of many individuals with prescribed 
characteristics. The value of the norms lies not simply in the numerical 
scores but also in the particular characteristics of the persons supplying 
the performances. The value of a test is a direct function of the degree 
to which the norm populations contain the characteristics that are mean- 
ingful for the new individuals on whom the test will be used. If we are 
trying to learn about the intelligence of a college freshman and use 
norms of an intelligence test that are based on the performance of high 
school freshmen and sophomores, the meaningful information we can get 
from his test score is extremely limited. It is the obligation of the con- 
structor of a test to collect and supply norms from the performances 
of individuals representative of the groups of persons for which the test 
is constructed. 

Various kinds of scores are available in terms of which the individual's 
performance can be cast. These were discussed earlier in Chap. 10 in con- 
nection with the measurement of level of achievement. 


MEASUREMENT BY MEANS OF INVENTORIES AND 
QUESTIONNAIRES 


Like tests, inventories and questionnaires are a means of presenting 
a series of standardized stimuli to a subject for the purpose of eliciting 
certain kinds of response. The term questionnaire as used here refers to 
the highly standardized instrument that is carefully constructed for meas- 
urement purposes, and not to the more commonly used assemblage of 
questions hurriedly put together merely to collect information. 

Contrasted with tests, which in the main are concerned with abilities. 
inventories and questionnaires are focused on discovering the preferences 
of the individual. In the use of these procedures the emphasis is upon 
attributing the response to some condition within the individual which is 
more or less enduring in nature. This is to say that the responses are 
elicited and studied as indices of some fundamental predisposition within 
the individual which functions as a determiner of his preferences. 

The Logic of Measurement by Means of Inventories and Question- 
naires. Doubt has been expressed that these instruments can be used to 
measure behavior, and so it is appropriate that we here briefly restate 
certain arguments presented earlier. Inventories and questionnaires can 
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be used to quantify responses by the procedure of frequency of occur- 
rence. Only in the sense that units of behavior are being counted and 
these counts used to describe and compare differences in the behavior 
represented are we justified in attributing measurement to these two 
procedures. 

Doubt also has been expressed that a predisposition to respond can be 
reflected in the replies made to items of the type used in these procedures. 
Admittedly, there is a distinct difference between a question that presents 
a problem for which the subject must supply a solution (as in ability tests) 
and a question that requires the person to reveal a preference or make a 
statement of fact connected with his Past behavior 
questionnaires). The difference, howev 
ment and nonmeasurement. In either 
elicited from the subject and these res 
as descriptions of certain of his predis 
two types of measurement is the same 

Two objectives must be realized in order to accomplish measurement 
by inventories and questionnaires; namely, items must be devised that 
ons of the given predispositions to be 
vised that elicit replies that are repre- 
the individual being examined. If these 


(as in inventories and 
er, is not one between measure- 
instance a series of responses is 
ponses are counted and evaluated 
positions. The logic underlying the 


ent only when it has been empirically demonstrated 
to be so. This principle immediately removes from consideration as meas- 


uring instruments inventories and questionnaires that base their useful- 


gment and beliefs of the persons who construct 
them. These latter devices can be used to 


collect information, but statisti- 
cal treatment of the data for the purpose of quantitatively assessing indi- 
vidual behavior is not justified, 
Empirical validation demand 


vent 
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the scores of successful physicians is said to have the interests common 
to the medical profession. Individuals will vary in the degree to which 
their scores agree with those of successful physicians, and this variation 
in frequency of occurrence of similar responses is interpreted as repre- 
senting differing amounts or degrees of interest in medicine. Only in this 
sense are we justified in claiming that we are measuring interests in 
medicine. 

The Representativeness of the Subject’s Responses. When it has been 
demonstrated that an inventory or questionnaire is valid, accurate meas- 
urement of a particular person’s predispositions depends upon the degree 
to which his responses truly represent his predispositions. The items must 
elicit responses that reflect his predispositions, and these responses must 
accurately and comprehensively reveal the nature and organization of 
these predispositions. Any purposeful distortion of the responses by the 
subject in order to conceal his predispositions results in invalid measure- 
ment. This way of responding is called faking. 

Faking refers to the attempt on the part of the subject to manifest re- 


sponses that do not accurately reflect his predispositions. We know that 
‘hich measures of predispositions are 


on many of the occasions in W A : 
needed, the subject is being evaluated in reference to some subsequen 


adjustment situation, such as entrance into a school to which he has ap- 
plied, or being hired for a job that he desires, or selecting a es on 
vocational career, ete. It is quite natural for him to want to give t 8 = 
performance he can. For some subjects this means that the — shou : 
be oriented toward making the best impression relative to the ~ 7 5 
adjustment situation rather than toward revealing accurately the pre 4 
positions under study. This faking or cheating results in ipo ativi 
ness in the responses and therefore invalid — n 
Although this source of error should be 8 ia . = ae 
interpreted as making measurement impossible. We must keep 
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item from an inventory on suggestibility. The subject was required to 
select one of the following three possible answers: 


In complete agreement with the experts 
Agree with the experts, but with reservations 
Disagree with the experts 


In adjustment inventories, statements are selected that reflect behavior 
found in major life-adjustment situations, such as the home, the school. 
the play yard, etc. The statements are so framed as to represent different 
kinds of adjustive responses, and the subject selects the responses that 
characterize his own behavior. Following are some sample items: 


Do you sleep well at night? 
Are you annoyed when people push ahead of you in a line? 


Would you rather read a book or go to baseball game? 


Items are sometimes formed of pairs of state 
required to select one statement of each pair. This type of item is used 
in some vocational-interest inventories. A number of vocational pursuits 
are arranged in pairs, each vocation being paired with every other one. 
The subject is required to indicate in each pair the vocation he would 
prefer to follow. His vocational interests can then be described by deter- 
mining the particular vocations that have the highest preference ratings. 

The Evaluation of the Subject’s Responses. A subject’s responses to an 
inventory or questionnaire are quantitatively expressed by using the 
method of frequency of occurrence, An evaluation of his responses con- 
sists in comparing them with norms collected from populations having 
the particular characteristics that are being described. For example, in 
al profession, the subject’s responses are 
arity with the responses of successful 


ments, the subject being 


oth the qualitative 
ject’s score depend upon their rela- 
answers made to the items by an ap- 
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population norm for these items, meaning can be evolved only by impos- 
ing the subjective judgment of the investigator. This, of course, is sub- 
ject to grave unreliability. 


MEASUREMENT BY MEANS OF UNSTRUCTURED 
STIMULUS SITUATIONS 


Measuring instruments in which there is little attempt to set up pre- 
scribed stimulus-response relationships are described as unstructured. 
Actually, all measuring situations are structured to some degree for the 
subject, What is meant by unstructured is really a relatively low degree 
of structuring of the stimulus situation. 

Unstructured testing situations are primarily concerned with discover- 
ing the organization underlying the predispositions that form an indi- 
vidual’s personality. The organization of the personality is presumed to 
be revealed in the observable behavior of the individual. To get this or- 
ganization reflected in behavior it is necessary to use unstructured situa- 
tions. The subjects behavior will then be determined by the relative 
strengths and interrelationships of his predispositions rather than by the 
structure imposed by the test situation. These unstructured situations are 
commonly called projective tests. 

The Purpose of Using Unstructured Stimulus Situations. The primary 
aim of unstructured situations is to observe personality in action, to pro- 
vide a stimulating situation that will encourage the subject to project 
the thoughts, feelings, motivations, attitudes, etc., that are characteristic 
of his personality. In order to do this the experimenter and the test situa- 
tion should contribute as little as possible to the nature of the specific 
responses. The fewer the restrictions placed upon the responses, the 
Sreater will be the chances of eliciting behavior that is representative 
of the personality of the subject. Although unstructured test situations 
do not provide opportunity for completely free responses, the stimuli 
they do provide are near-natural in character, and thus the situations tend 
to elicit near-natural types of responses, 

One of the primary objectives of unstructured tests is to reduce the 
Subject’s purposeful control of his responses. The tests are planned to 
Minimize the possibility of the subject's selecting and manifesting only 
10se responses that are of the socially approved type or which in some 
Way are satisfactory to him but not representative of his personality 
and inhibiting other forms of response that are not socially approved or 
Which at that moment are not satisfying to the subject. What is wanted 
we responses characteristic of the inner dispositions and not just re- 
SPonses that the individual is willing to put on display for others to ob- 
Serve. What is needed, then. is a form of stimulating situation that will 
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elicit a very wide range of responses which will reflect the many ame 
of the personality. The wider the range of responses, the higher the 
chances of eliciting behavior that is representative of the personality. 

Inasmuch as the subject may be wary of revealing any responses that 
are in any sense unsocial, it is desirable to prevent him from knowing the 
particular purpose of the testing. If his cooperation can be elicited with- 
out his becoming concerned with the nature of the situation to which he 
is asked to respond, purposeful control of his responses will be greatly 
reduced. The more the test situation is structured, the more readily can 
the subject discern the possible uses that might be made of his perform- 
ance, and the greater the likelihood that he will introduce conscious 
control over his responses. : 

Sometimes inner motivating conditions are active without the indi- 
vidual’s being conscious of them; that is, there are features of behavior 
that even the individual cannot trace to known purposes or desires on his 
part. It is important that these features of personality be studied, and it is 
doubly difficult to detect and describe them when the subject purposely 
decides on the type of response he is going to make. 

The Structuring of the Situation by the Subject. It should be remem- 
bered that the word structure refers to the meaningfulness of the stimu- 
lating situation that is determined by the organization imposed on the 
stimulus elements. With unstructured stimuli, a minimum of organization 
of the stimulus elements is imposed by the experimenter. Great freedom 
is allowed the subject to structure the situation for himself, The specific 
meanings assigned the stimulus elements are largely contributed by the 


subject in terms of his own perceptual, emotional, rational, attitudinal, 
etc., predispositions. 


Kinds of Stimuli. A wide varie 
been utilized in projective tests. 


s e personality are presumably exposed in the 
meaningfulness exhibited in the stimulus-response relationships that the 


subject forms by his replies and by other characteristics of his behavior 
that reveal emotional disturb 


A more complex verbal type 
story, which is then used as 


onsisting of cloudlike 


masses shaded in different tones of gray, nonsymmetrical in shape and 
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indefinite in outline. The subject is shown a picture and asked to tell 
what he sees. The meaningless nature of the forms is conducive to a wide 
range of responses. Another unstructured stimulus consists of photographs 
of ambiguous pictures, usually containing human figures. The blurring 
of the outline of the objects represented, the indefiniteness of design, and 
the lack of detail make the pictures rather ambiguous in meaning. The 
subject is asked to tell the story that he thinks a picture depicts. The 
Thematic Apperception Test is a widely used test of this kind, 

Kinds of Responses Observed. There are no restrictions on the kinds of 
responses that the investigator may use in his analysis of the personality, 
Overt movements, subtle gestures, inflections of voice, signs of inhibiting 
response, as well as the content of verbal expressions contribute to the 
analysis. 

The uniqueness of a subject’s responses is of vital concern. If the per- 
sonality of a given individual is to be understood, we must learn how 
he differs from other individuals. It is important then to discover the 
unique meanings that a given individual places upon the stimulus-re- 
Sponse relationships of the test situation. The investigator must closely 
observe in a subject the interplay among the perceptual, emotional, ra- 
tional, and other forms of response if he is to determine successfully the 
meaning any given response has for the subject at the time it is manifest. 

The Recording of the Subject’s Responses. No attempt is made to 
obtain a record of all the subject’s responses, but permanent registration 
of the more significant phases of his responses is attempted. If the subject 
does not make a record of his performance, as in a written completion of 
a story, the investigator usually takes notes. Forms for recording responses 
by shorthand symbols make possible rather full accounts of the behavior. 
It is necessary that the activity of recording not contribute significantly 
to the structuring of the test situation; therefore, there is little use made of 
any kind of instrumental registration unless it is accomplished without 

isturbance to the subject. 

The Evaluation of the Subject’s Responses. In an unstructured stimulus 
Situation, the responses are studied not in their own right as adaptive re- 
Ponses to prescribed stimuli, but as indices of the more general person- 
ality Organization that presumably pervades all of the individual’s re- 
Ponses. The meaning to be assigned any given response is determined 
Primarily in terms of its integration with other responses comprising the 
Personality, and so this meaning will vary with changes in the specific 
Conditions under which the response occurs. It is then possible for the 
Same response to have different meanings in different contexts even 
though it is a manifestation of the same personality. 
ne responses observed in an unstructured stimulus situation are spe- 


cific in nature; the personality to be described is general in nature. The 
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responses are then rationally manipulated in ways that promise to shea 
light on the nature of the inner personality mechanisms. This is ace ns 
plished by means of rational constructs derived to a large extent Shame 
the intuitive judgment of the experimenter. Some of these constructs are 
far removed from empirically observed facts, as, for example, the con- 
cept of reified unconscious. Other constructs hew closer to empirical 
evidence and get their characteristic meanings primarily from observable 
data, as, for example, the concept of conflict. — 

For the most part, procedures for scoring and evaluating the subject's 
responses are not highly standardized. For most test situations. the norms 
are not objectively and statistically derived as in tests of ability. Rather, 
they are frequently based upon the subjective values found in the psy- 
chological, logical, and philosophical points of view of the investigator. 
These subjectively derived norms are applied to the responses during the 
rational analysis. Not being objectified, they cannot be readily submitted 
to empirical check by other investigators. Admittedly, a great amount ol 
research must still be accomplished on projective-test situations before 


they can be accepted as reliable and valid scientific measuring in- 
struments, 


MEASUREMENT BY MEANS OF RATINGS 


Rating procedures are primarily concerned with obtaining accurate 


quantitative descriptions of global types of behavior. Some notion of the 
widespread use of ratings can be obtained from the following types ol 
response to which they have been applied: attitudes, opinions, beliefs, 
preferences, personality traits, character traits, abilities, and interests. In 
some procedures the behavior to be rated is elicited by means of experi- 
mentally controlled stimuli, and in others the behavior results from un- 
controlled naturalistic stimuli. 


Kinds of Measurement in 


Rating Procedures, Expert Judgment Based 
on Observation of Behavior., 


In this procedure the quantifying of the be- 
havior is based on the judgment of a so-called expert, eg., a teacher in a 
schoolroom, an officer in a military unit, a foreman in a working gang. 
More than one judge is usually necessary to get an accurate measure- 
ment. A rating scale may be used. This consists of a series of statements 
or paragraph descriptions describing the characteristics of the responses 
that the judge is to evaluat de a line on which the 
judge indicates the quantit alue of the response, or it m 


a series of categories representing different ar 
chooses that category that best de 
In this procedure the judge 


e. The scale may provi 
ative v ay provide 
mounts, and the judge 
scribes the subject being rated. 


making the rating must have had the 
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opportunity of observing the specific responses to be evaluated as they 
were performed by the particular individual being assessed. This re- 
quirement can be met in two ways. In one procedure, acquaintance judges 
are used; that is. persons do the rating who through their past associations 
with the subject have observed him behave in situations in which the 
responses being measured would normally be expressed. In the other 
procedure, an experiment is devised in which the subjects are placed in a 
Stimulating situation that would normally call out the behavior to be 
assessed, and the judge observes a subject respond before making an 
evaluation of him. 


Self-appraisal. In this procedure the subject rates or judges himself, 
With adults, some form of scale is used to represent or portray the re- 
Sponse to be evaluated, and the subject checks that point on the scale 
that he thinks best represents his own behavior. Self-appraisal may be 
used with children by presenting behavior situations in the form of 
descriptive statements in which some personal characteristic is being 
varied and asking the child to select the statement that best describes 
nis Own way of doing things. Self-appraisal procedures have significant 
application when the child's notions about himself are being studied, 

Many attitude scales utilize the self-appraisal method. The individual 
examines a list of scaled statements and selects those that he considers 
represent his own thinking and convictions on the issue involved. 

Comparison of Products with Scaled Samples. This procedure contains 
two parts, namely, a sample response or product made by the subject 
Who is to be evaluated, and a set of samples of the same type of response, 
Which have been scaled on a particular continuum in terms of their qual- 
ity, excellence, or some other similar meaning. The task of the rater is not 
one of estimating the quality of the subject's sample but rather of deter- 
Mining that specimen on the scale that is most nearly like the sample 
Product of the subject. The score given to the subject's response is the 
Score of the scaled specimen that the judge selects. 

„ _ Wo Types of Stimuli in Rating Procedures. In using rating procedures 
it ig Necessary to distinguish two types of stimuli, namely, those that 
elicit the behavior in the subject who is to be rated, and those influencing 
JC judge at the time he is making his evaluation. When an experimental 
“Mtuation is created to elieit the behavior to heated, both types of stimuli 
are present in the measurement situation. Of particular importance is the 
act that the judge is in the situation and gets firsthand and immediate 
Observational data for his rating. When a judge is asked to rate a person 
rom his Past acquaintance with him, the stimuli on the rating scale or 
"ating form are uscd to aid the judge in understanding the behavior to be 
evaluated so he will be more likely to recall from his past experience 
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those particular responses of the subject that are pertinent. Of course, 
the stimuli that led the subject to make the particular responses recalled 
by the judge are usually unknown and indeterminable. 

In measurement by use of ratings, the purpose of the general instruc- 
tions of the rating procedure and of the specific items on any scale that 
is used is to acquaint the judge with the characteristics of behavior that 
are to be rated. It is intended that these verbal stimuli will elicit recall and 
evaluative reactions in the judge and will direct and restrict these recall 
and evaluative reactions to the particular behavior to be measured. These 
stimuli then contribute significantly to the determination of the particular 
responses that are rated, even though they are not the stimuli that elicited 
responses in the subject himself. 

The two types of stimuli are found in attitude measurement. The area 
of attitude being measured is delineated by general descriptions of the 
scale and by the individual statements comprising the scale. These stimuli, 
however, do not determine the attitude of the subject; they merely func- 
tion to elicit reactions of recall, reasoning, comparison, etc., by which the 


subject determines those items that depict an attitude in agreement with 
his own. 
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then be observed many times before any final judgment is formulated. 
Such instrumental registration, however, can be used in only a small pro- 
portion of the total number of situations for which the rating method 
of measurement is applicable. 

The Accuracy of Measurement by Ratings. The accuracy of measure- 
ment by ratings depends directly upon getting the judge, or the subject 
if he is functioning as a judge, to do a conscientious job of evaluation. 
It is possible to get a measure that is completely fictitious. For example, 
in attitude measurement not every individual who is measured on a par- 
ticular attitude scale has crystallized his attitude on the issues being repre- 
sented. The question arises as to what is measured when an individual 
responds to an attitude scale when he really has not formed an attitude 
about the issue. Similarly, in the use of ratings by expert judges, a rater 
may not have an adequate understanding of the responses being assessed, 
or if he has an adequate understanding he may not have had an oppor- 
tunity to observe the individual being rated in the appropriate situations 
reflecting these responses. Yet in either case he will follow the prescribed 
rating procedures and evolve an evaluation of the subject. Again, the 
question arises as to what has been measured in such an instance. 


THE INTERVIEW AS A DATA-COLLECTING PROCEDURE 


Like the other procedures thus far discussed, the interview is a means 
of collecting data about a subject’s behavior. The term interview stands 
or a generic concept which includes a variety of procedures used in col- 
lecting data through a person-to-person contact between an interviewer 
and a respondent. An interview may be conducted in a casual manner 
and the respondent may then not even be aware that he is being inter- 
Viewed, or it may be highly formalized, requiring special physical facilities 
and adhering to a thoroughly standardized routine. 

he Scientific Status of Interviewing Procedures. Interviewing varies 
Widely in respect to its scientific worth. As a method for studying the de- 
terminant relationships of a given behavior phenomenon, it is a difficult 
Procedure and subject to many pitfalls. In the hands of an untrained per- 
Son an interview is worthless, being reduced to a biased selection of re- 
Plies made to a series of questions the stimulating value of which is to a 
arge extent unknown and unknowable. 
© are interested in the interview as a scientific instrument. When 
Carefully planned in regard to purpose, questions, observations, record 
ing, and analysis of results, the interview assumes the characteristics 
Ob a scientific procedure. Unplanned, with variable purposes and ques- 
oning procedures, with biases of the interviewer entering into the selec- 
ion, recording, and analysis of the material, it may become completely 
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unreliable and invalid. It then cannot be dignified with the name of 
* Advantages of Interview Procedures. One advantage of the w 
terview is its flexibility. For certain kinds of fact collecting it is ie? 
desirable to have a procedure that can be varied to meet the dampnis 
of the moment. In measuring human behavior it is not possible to n 
all problems by exposing every individual to standardized and — e 
procedures, Differences are sometimes more readily detected by vary ing 
the measuring procedure than by keeping it constant. It is true, of — 
that in the process of varying the procedure the measurement ceases to be 
identical for different individuals. The resulting error may or may not 
be of serious consequence. Comparisons of different persons will be ad- 
versely affected, and when quantitative values are involved such compari- 
sons are not justified. In the evaluation of a given person, the errors may 
be inconsequential. This is true when we need to obtain all of the knowl- 
edge we can about an individual. For ex 
know about John Smith as an individua 
member of a group. 

Another significant characteristic of the interv 
a situation for gaining knowledge 


ample, we may primarily want to 
l, and not about John Smith as a 


iew is that it provides 
about behavior that is closely associated 
with underlying personality predispositions, As in the case of unstructured 
stimulus situations, this behavior is considered a manifestation of inner 
conditions which can be reached only by inference through rational 
analysis. Any conclusion is then subject to the errors of the intuitive judg- 
ment of the interviewer, 


A further advantage is the possibility afforded in the interview situa- 
tion of gaining ideas about the interrelatedness 
sponses of different kinds. The hum 
as a unified whole, but the structure 
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Sponse question overcomes this limitation in some measure, but in so 
doing it produces an unstructured situation in which the control of the 
stimulus-response relationships passes from the interviewer to the re- 
spondent. The meanings of the responses must then be interpreted in 
terms of their own interrelationships rather than in terms of a known 
objective set of stimuli provided in the form of precise questions. 

The interviewer plays a very significant role in the stimulus situation. 
Even when his primary job is to administer a set of highly standardized 
questions, his manner of executing the procedures can have a definite 
effect upon the nature of the results obtained. His influence on the recall, 
thinking. and verbalization of the respondent can have a determinant 
effect on the particular sequences of behavior that are revealed. It is then 
of prime importance to have well-trained interviewers. 

The Response Situation in Interviewing Procedures. When a highly 

structured interview is conducted, a direct control is presumed to be 
effected over the stimulus-response relationships that are elicited. This is 
true if there is no prefabrication of erroneous responses or if there are no 
defensive replies that distort the responses from those characteristic of 
the respondent. When there is no structuring of the interview, any manip- 
ulation of the respondent's replies is accomplished after the interview 
is completed. By a process of selection those responses pertinent to the 
Particular problem being investigated are discovered and subjected to 
Special analysis. 
It is often important that a full record be made of what transpires 
an interview situation. The immediate use made of the collected 
Material is not the only use that may be made of it. Individuals other 
than the interviewer may have to evaluate the material, and they can do 
an adequate job only if they have a fairly comprehensive record of the 
Interview proceedings. 


in 


The Evaluation of the Respondent's Replies. Highly structured inter- 
views are amenable to quantitative treatment. The method of measure- 
Ment by frequency of occurrence is applicable if the conditions of this 
method are met, For example, in consumer-research investigations it is 
Possible to design interviews so that statistical methods can be applied to 
Measure the effects of different factors on the purchasing and use of a 
Siven commodity. It is known that economic level. amount of education, 
use, number of ‘persons in the family, and similar factors influence the 
Uying of goods, By correctly designing the study and interviewing a 
Sample of consumers in whom these several factors are found as they 
Occur in the actual market situation, we can determine the relationships 
etween the factors and consumer-buying behavior as these relationships 
are revealed in the use of a particular brand of some commodity. 

In the unstructured interview the evaluation of the respondent's replies 
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depends on the subjective judgment of the interviewer. Obviously, ac- 
curate quantitative description of the respondent's behavior is difficult, 
if not impossible, to achieve. 
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CHAPTER 15 


Scientific Method in Field-type Studies 


In order to perfect their methods, the earlier scientists went “indoors” 
and set up laboratories where they could create simple experimental 
situations that would yield to the analytical approach. Above all else, 
experimentation demands control of variables, and science was developed 
in a restricted environment in order to achieve this goal. The outcome 
Was what we know as modern laboratory science. 

Laboratory experimentation has proved itself a forceful method for 
attacking behavior problems. It has pioneered the way in nearly every 
Phase of psychological science. Through it enormous numbers of facts 
have been collected, scores of procedures have been devised, thousands 
of determinant variables have been evaluated, and large numbers of ex- 
planatory concepts and principles have been evolved. 

With the formulation of a large number of dependable procedures the 
Scientific psychologist in recent years has broadened his interest to in- 
clude problems that are not amenable to study in the rigorous laboratory 
environment, Laboratory experimentation is still vigorous and fertile in 
achievement, but today, with greater refinement in methods, the psycholo- 
gist is directing his attack upon the more intangible and global complexes 
of behavior that are not readily re-created in a laboratory. 

The present chapter is concerned with nonlaboratory or field-type 
Studies. First we shall consider factors that historically have led to the 
application of scientific method to field-type problems. In the remainder 
of the chapter four field-type experiments are described. The first investi- 
gation is developed in a step-by-step progression to illustrate the applica- 
tion of scientific methodology under nonlaboratory conditions. The prob- 
lem dealt with is the evolution and validation of a program for the selec- 
tion and classification of workers. The next two examples illustrate how 
well-designed experiments can be executed under variable conditions in 
the field. The first describes an investigation of two methods used in the 
treatment of schizophrenia. The second is a study of human motivation 
in a factory. The fourth investigation is included to illustrate how in- 
Senious procedures can be devised to overcome deficiencies of field 
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conditions. A method is described for effecting constant stimulus condi- 
tions in a group experiment involving social interaction. 


SOME HISTORICAL ANTECEDENTS OF FIELD-TYPE STUDIES 


The Problem of Controlling Variables. In the beginning the psycholo- 
gist utilized the physical control procedures of the natural sciences. 
His problems were centered on the analysis of sensation; and the light. 
sound, and other stimuli he used were generated and measured by the 
then known procedures of physical science. Procedures for registering 
responses were required, and he again adapted those that the natural 
scientists had devised. 

‘In the areas of the more complex mental responses, the psychologist 
seldom could achieve the control he desired simply through the manipula- 
tion of the physical stimulating situation. As we learned in earlier chapters. 
he had to invent means for equating the past experiences and schooling 
of the subjects he used. He had to make sure that his subjects were 
representative of the groups that he wanted to describe. He had to isolate 
for separate study such global factors as home environment, economic 
status, and the like. He then devised procedures for controlling variables 
through the selection of his subjects, his materials, and his data, and by 
adapting to his variables the logic and analysis of mathematical statistics. 

With the growth of the science, psychology developed a large number 
of specialized control procedures which, when they were perfected, 
sg up new vistas for the application of the scientific method. 

(The Distortion of Variables under Laboratory Conditions. When the 
psychologist took his problems into the laboratory, he sometimes found 
that the variables he studied were not exactly the same as the variables 
he had intended to study. The constricted and rigid conditions of the ex- 
perimental laboratory had distorted the expression of the variables from 
that whieh 1 have been expected to occur under natural conditions. 
hn tre scr as ee wh v. 
cedures.) The controls the ‘ J 1 8 he po ah senp e 
laboratory had introduced ia 3 aE = F as 
were forced to operate in wa tee a * en themselyes. ss oe 
rences. For example etek oe re af e nenen 
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gether. Such rigorous control prevented interac 5 
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that any variable would have manifested. Experimental variables in the 
laboratory were then not always representative of natural experimental 
variables. 

From purely scientific motives the psychologist proceeded to study his 
variables under conditions more nearly like those of natural situations. 
In so doing he modified his familiar laboratory procedures and devised 
many new ones that were more serviceable under the new conditions. 

The Inapplicability of Laboratory Methods to Some Problems. The 
scientific psychologist, in expanding the application of his experimental 
procedures, encountered situations that he could not reproduce in the 
laboratory. For example, the study of newborn children and the study of 
crowds imposed conditions that were impossible to achieve in the labora- 
tory. The psychologist therefore took his laboratory procedures to the 
nursery and to the street. He proceeded to set up field studies. In some 
instances, as in the studies of young infants, he could duplicate his labora- 
tory in nearly all respects; in others, such as in the study of crowds, the 
conditions were so radically different that the procedures developed no 
longer resembled those of the laboratory. 

In the field situation the problem of finding the relative importance 
of the several factors conditioning an event still remained central. The 
Psychologist still wanted to separate the more important factors from the 
less important and to discover the functional relationships existing among 
the more important determiners. In many field situations these problems 
Proved insoluble by application of available laboratory procedures, but 
eventually statistical methods provided means for their solutions. 

The Development of Interest in Applied Science. Not only did psy- 
chologists want to study behavior under more natural conditions of ex- 
Pression, they also desired to study variables that would immediately and 
directly further the understanding of problems in applied science. They 
endeavored to transfer to applied problems the procedures developed in 
laboratory experimentation. After some modification, many of these pro- 
cedures operated successfully, but additional procedures had to be de- 
vised to meet the distinctive features of the applied situations. 

: Within many applied areas the use of scientific procedures has resulted 
in marked improvement in the control, prediction, and understanding of 

ehavior, Applications in the area of education came early. Many labora- 
tory psychologists were professionally working in this area as teachers 
and took advantage of the facilities that the classroom situation offered. 
The possibility of applying psychological techniques was soon recognized 


in other areas, such as in law, medicine, industry, and vocational 
Counseling. 


A comment needs to be made here concerning the application of scien- 
tific method to practical problems of behavior. We are primarily inter- 
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ested in the basic research phases of these problems. A distinction can 
be made between the task of discovering the principles of organization 
underlying the predispositions by which individuals achieve satisfactory 
adjustment in any type of behavior situation and the task of discovering 
the best method for applying these principles to the adjustment problems 
of a given individual or group of individuals in a specified type of situa- 
tion. The first task is the problem of the scientist, the second is the prob- 
lem of the practitioner. Obviously, we must understand a principle before 
we can apply it accurately. This understanding is gained from executing 
basic research studies under the conditions that are operative in the 
practical situation. Let us not make the error of thinking that basic re- 
search can be done only in a laboratory, that the basic nature of a study 
is a function of the place in which it is conducted, The purpose of the 
study and the nature of its methodology are the f 
termine whether or not it can be classifi 
described in the following pages belie 
can be conducted onl 
laboratory. 


actors by which we de- 
d as basic research. The examples 
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to perform effectively on a job. In classification we discover the assign- 
ments of workers to jobs that will achieve the highest over-all utilization 
of the available manpower. These two concepts are closely related and 
thus often confused. Frequently, the emphasis is on worker selection to 
the complete neglect of worker classification. In general, the goal toward 
which society is striving is full employment, and therefore the greater 
emphasis should be placed on worker classification. 

In selection, we are dealing with the problem of finding workers to 
fill a certain job. The emphasis is upon getting the job filled, and our 
concern is with learning if a worker applicant can perform the job. 
Usually there is only one job under consideration but many applicant 
workers to be evaluated. We are then oriented toward the job. We ask 
that the person to be hired be capable of doing the best work. We seldom 
are concerned with whether the worker is better fitted for other jobs 
and whether another job would offer more challenge to his abilities 
and prove more interesting to him than the job for which he is being 
considered, 

In classification, there are several jobs under consideration, as well as 
many workers to be evaluated. There is the problem of matching the 
workers and the jobs. The jobs must be filled by competent workers so 
there will be effective performance. Also, each worker must be placed in 
a job in which his abilities will be challenged and effectively used. The 
problem is not one of finding the particular worker who can best perform 
a particular job; neither is it the problem of finding the particular job that 
a particular worker can best perform. There is a compromise between 
these two approaches. There being several jobs and several workers, no 
one job can be considered to the exclusion of the others, nor can one 
Worker be considered to the exclusion of the others. The objective is to 
achieve the best possible job performance and the best possible worker 
adjustment when all jobs and all workers are considered as parts of the 
same problem. 

The Area of Hypotheses in Worker Selection and Classification. The 

Problem of matching jobs and workers offers a rich area in which to 
develop hypotheses, These hypotheses revolve primarily around the psy- 
chological predispositions that underlie successful job performance. 
_ The Definition of a Job. The first problem of definition concerns the 
job itself. It is essential for us to know the requirements of the job; what 
kinds of tasks are performed, what kinds of responses a worker must use 
In order to achieve success, etc. When the job involves skilled responses 
the task of definition is somewhat less difficult than when it is of the 
administrative or executive type involving the use of judgmental re- 
sponses, 


340 Some Individual Scientific Procedures 


The method of analyzing and defining a job is called jem 5 
Several well-developed procedures are available for discovering i 
evaluating information about job requirements and job tasks. —M 

In the Aviation Psychology Program the first problem of iat 1 
cerned the kinds of responses required of the pilot, of the bom ie 
and of the navigator. To be a successful pilot a person must know N en 
the plane is operating correctly, must effectively adjust the D a 
trols by which the plane is maneuvered, must correctly interpret t i a 
formation provided him on the control panel, must make ESEE, 
volving the relations of the plane with the crew, weather, and ig 
objective, and must make many other responses too numerous to > 
here. Similar behaviors are required of the bombardier and of the na\ 5 
gator, except that the responses will not be exactly the same as thusa i 
the pilot. There will be some responses common for the three siecle 
positions, but there will be many responses unique for each of the ie 
tions. One of the early tasks of the aviation psychologists was to make 
comprehensive job analyses of the three aircrew positions. „ 

The Deſinition of the Psychological Predispositions. With e 
of what the job requires, our next task is to discover the psvchologios 
predispositions that are involved in performing the job responses. Thy 
job responses are translated into the abilities, interests, attitudes. skills, 
etc., of the worker. The procedures of worker analysis are available 
this purpose. The term worker analysis does not refer to the study of any 
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In the Aviation Psychology Program, the job responses of the three 
aircrew positions had to be translated into psychological characteristics 
in the form of interests, abilities, attitudes, and the like. From the knowl- 
edge then available about airplane pilots and navigators it was possible 
to make accurate judgments about many of the predispositions for these 
positions. Less information was available on the characteristics of bom- 
bardiers. Rather than make a piecemeal attack on the worker character- 
istics, however, a systematic approach was evolved by organizing psycho- 
logical research units for broad areas of predispositions and charging each 
unit with making a comprehensive study of the predispositions within the 
area assigned to it. These research units were located at cadet classifica- 
tion centers. Four primary research areas were selected, as follows: (1) 
information, judgment, and intellectual ability, (2) alertness, observation, 
and speed of perception, (3) motor coordination and visual-motor skill, 
and (4) personality, temperament, and interest. Each psychological re- 
search unit concerned itself with developing procedures for identifying 
and evaluating psychological predispositions within its general area which 
were essential to the successful performance of the jobs of pilot, bom- 
bardier, and navigator. 

The Formulation of Hypotheses. Simultaneously with the determination 
of the job requirements and the translation of the job responses into 
worker predispositions, many ideas arise about possible relationships be- 
tween the predispositions and successful performance on the job. Hy- 
potheses then are formulated as functional relationships between the psy- 
chological predispositions of the worker and successful execution of the 
job responses, 


From the job analyses and worker analyses of the pilot, navigator, and 
bombardier came ideas about what kinds of motor skills, intellectual 
judgments, personality traits, and the like would be required to success- 
fully pilot an airplane, navigate a course in the sky, or make the neces- 
sary adjustments of a bombsight to effect a direct hit of a target. As the 
aviation psychologist pondered about these interrelationships he de- 
veloped hypotheses for possible investigation. 

The Evolving of Theorems. Identifying Responses That Reflect Predis- 
positions. The functional relationships between psychological predisposi- 
tions and job performance evolved at the hypothetical level must be 
expressed in tangible forms that lend themselves to empirical verification. 
After studying the available facts about the job responses and the worker 
characteristics, we hypothesize that certain predispositions underlie cer- 
tain successful job responses. We must then determine what kinds of be- 
havior can be used to reflect the predispositions we think are essential 
to the job responses. Obviously, we can try out the worker on the job, 
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but this is wasteful of time and money. The raison d'être of a — 
and classification program is that it is a faster and cheaper 5 
identifying and measuring the predispositions of the successful se a 
In previous chapters several short-cut procedures for learning a Ap 
predispositions were described. One of the fastest and most reliable pro- 
cedures is the use of various kinds of ability tests. Another is to get a 
record of the worker’s relevant past experiences. This is often done by the 
interview method. Other short-cut procedures include the use of in- 
ventories and ratings. 
In the selection and classification of pilots, bombardiers, and navi- 
gators, pencil-and-paper tests and apparatus-psychomotor tests were the 
chief means of measuring the selected predispositions. These instruments 
were supplemented by information obtained through ratings and through 
interviewing the cadets. Following are some of the more effective pencil- 
and-paper tests: arithmetic reasoning, dial-and-table reading, spatial 
orientation, biographical data, numerical operations, reading comprehen- 
sion, judgment, general information, instrument comprehension, mechani- 
cal principles, mechanical information, and speed of identification. Some 
of the more effective psychomotor tests were rotary pursuit, two-hand co- 


ordination, complex coordination, rudder control, discrimination-reaction 
time, and finger dexterity. 
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tiveness in job performance is just as important as a measure of the be- 
havior selected to reflect the predispositions. Without either measure we 
cannot empirically examine the theorem. 

In the area of matching workers and jobs, the measure of job success 
by which differences in job performance are detected and evaluated is 
called the criterion. The criterion is usually a reliable and valid practical 
measure of job performance, such as the productiveness of performance 
on the job itself. The criterion should be quantitative in nature and 
easy to obtain. 

Although the most valid measure of the effectiveness of an air-force 
pilot, bombardier, or navigator is how well he performs his job in combat, 
the use of combat criteria for evaluating the selection and classification 
program was not possible because of difficulties of obtaining measures of 
combat performance in time of war. Partial and intermediate criterion 
measures of performance in training schools were the principal ones used. 
Performance in operational training—training just preceding assignment 
to combat duty—was available in the later stages of the war. A simple 
index of effectiveness of performance in navigation training was the 
student's rank in his class at the time of graduation. In pilot training the 
student's success or failure in graduating from primary training, basic 
training, and advanced training was used. In bombardier training the 
student had to complete several aerial missions on which a certain 
number of bombs were dropped. His average circular error was then a 
measure of his performance as a bombardier. In all three positions class 
grades in various required courses also were available as criteria. 

The Empirical Testing of the Theorem. The General Nature of the 
Testing Situation. A theorem is confirmed if empirical evidence support- 
ing its consequences can be found. In the problem of matching workers 
and jobs the theorem states that certain test responses as manifestations 
of certain predispositions are valid indices of successful job responses 
and that the consequence of using these test responses in selecting and 
classifying the workers would be an improvement in job performance. 
The empirical testing of the theorem is accomplished when test scores 
measuring the predispositions are collected, the persons are allowed to 
perform on the job, the job-performance measures are obtained, and the 
job-performance measures are checked against the test scores. If there is 
a positive relationship between the two sets of measures, then the test 
Scores are an index of job responses. The higher this relationship is, the 
higher will be the validity of the index. 

In the Aviation Psychology Program a large number of tests were 
constructed in order to assess the predispositions. Not all of these tests 
were actually used. Similarly, in order to assess the job responses many 
criteria were investigated; only a few of these were found to be reliable. 
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The cadets were given the written and apparatus psychological tests an 
thorough physical examinations at the classification centers. They ie 0 
then sent to various aircrew training schools. At the completion of ver" 
major phase of training, measures of their success in school were formec 
into training criteria. These were the job-performance measures. The test 
scores and the job-performance measures were then correlated to deter- 
mine the predictive effectiveness of the various tests. 

The Validation Group. It is important th 
the group on which the empirical testing 
is readily seen when we consider the possibility of immediately using the 
tests for selecting new workers, But it is also important from the point 
of view of the basic design of the research, The theorem states a relation- 
ship between two sets of measures. Both of these measures are in part 
determined by the characteristics of the group on which they are ob- 
tained. In setting up the hypothesis and deriving the theorem, certain de- 
fined jobs and certain defined workers are studied and evaluated. If an 
adequate test is to be made of the theorem it is necessary that the sub- 
jects be representative of these workers in order that the conditions of 
the theorem can be translated into an empirical testing stituation. 

We can use two types of groups for validating selection and classifica- 
tion procedures. One is composed of applicants for jobs, the other is com- 
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The Measurement of the Functional Relationships. The final check of 
the theorem is made by quantifying the degree of relationship existing be- 
tween the measures of the predispositions and the measures of the job 
responses. If both variables are quantitative in nature, this step is greatly 
simplified by using an appropriate statistical equation. When the valida- 
tion group is comprised of job applicants there is usually a wide range 
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Fic, 12, Per cent of cadets eliminated from primary pilot training, based on the rec- 
ords of 185,367 cadets. 


of variation in both the predisposition scores and the job-performance 
scores, The appropriate statistic is then some measure of correlation. 

In the Aviation Psychology Program, coefficients of correlation were 
computed between the criteria obtained from the different phases of 
training and the three stanine scores. Similar coefficients were computed 
between the criterion measures and the scores on each of the tests. Such 
coefficients are known as validity coefficients. Near the end of the war 
the validity coefficients of the stanines for the three aircrew positions 
were for pilot .58, for navigator .61, and for bombardier .38. Graphs were 
also made, diagramming the relationship existing between the rate of 
elimination and the stanine scores. The relationship found between the 
pilot stanine and elimination from primary pilot training is presented in 
Fig. 12. It will be seen that there is a high positive relationship between 
the stanine score and the proportion of cadets eliminated from further 
pilot training. 
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Both the correlation coefficients and the graph provide evidence that 
the selection and classification of aviation pilots achieved remarkable 


success. Comparable evidence also was obtained for the positions of navi- 
gator and bombardier. 


AN EXPERIMENTAL STUDY OF METHODS OF 
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the two experimental groups. An attempt was made to control the fol- 
lowing factors by this procedure: 

1. Policy of admission to the hospital 

2. Criteria used in diagnosis 

3. Relative percentages in the experimental groups of different forms 
of the disease 

4. Freedom of the subjects from other kinds of disease 

5. Racial composition of the groups 

6. Age and sex composition of the groups 

7. Economic, occupational, and educational levels of the groups 

The Criterion Used for Evaluating Differences between Treatments. 
The effects of the two treatment methods were measured in terms of the 
rate of remission, that is, the degree of improvement shown at the end of 
a period of time following release from the hospital. Two classifications 
were used for measuring the extent of the improvement. Patients who 
fully recovered or recovered sufficiently to adjust socially at approximately 
their former level were placed in one category. Patients who continued 
to show defects but were able to live in the community at a somewhat 
lower level than previously, and those who showed no improvement, were 
placed in a second category. To determine if the effects of the treatments 
varied with the type of onset of the disease, each treatment group was 
divided into those manifesting acute onset and those manifesting gradual 
Onset, and a comparison was made of the rates of remission. A further 
division of the two treatment groups was made in terms of the length 
of the illness, and comparisons were made in terms of the remission rate. 
These comparisons were done to check on the statement often made that 
early treatment of schizophrenia with insulin shock produces a higher re- 
mission rate than similar early treatment by more conservative procedures. 

Procedures of the Experiment. Sixty-six patients, in whom typical symp- 
toms of schizophrenia were manifest, comprised the insulin-shock group. 
Each patient was given progressively larger amounts of insulin until the 
dose was sufficient to produce coma within 2 or 3 hours after injection. 
He was allowed to remain in the coma from 1 to 3 hours. During the ex- 
periment each patient was subjected to 30 periods of coma at the rate 
of 6 per week. 

One hundred and thirty-two patients constituted the psychotherapy 
group. Each one was given a psychiatric examination to determine the 
specific problems and conflicts that might have bearing on his maladjust- 
ment. An attempt was made to understand the development of his per- 
sonality in terms of the difficulties he experienced in making social 
adjustments. Whenever possible, individualized social-adjustment pro- 
grams combined with psychotherapy were given to help the patient solve 
his conflicts and develop adequate social responses. Not all patients were 


348 Some Individual Scientific Procedures 


exposed to a full program of reeducation or rehabilitation owing to factors 
limiting their stay in the hospital. ; 7 

Follow-up clinical evaluations of each patient’s condition were made at 
intervals of 6 months for a period of from 1 to 4 years. The clinical ex- 
amination revealed the presence of any psychotic symptoms, the serious- 
ness of these symptoms, signs of further deterioration, the success with 
which the patient was adjusting in his family and community, his cur- 
rent level of adjustment as compared with his former adjustment in the 
hospital and before treatment was begun, and other similar information. 

A Comparison of Insulin and Psychotherapy Treatments in Terms of 
Rate of Remission. The findings indicate no significant differe 
rate of remission between the two treatment groups. The pe 
ing in the two categories used for measuring the degree 
based on the last clinical examination, are given in T 
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Table 7. Comparative Effects of Treatment of Schizophrenic Patients by 
Insulin Shock and Psychotherapy Analyzed in Terms of Type of Onset 


Per cent fully | Per cent not 
Onset Treatment or socially or slightly 
recovered improved 
Acute Insulin shock 45 55 
Psychotherapy 45 55 
Gradual....| Insulin shock 26 72 * 
Psychotherapy 29 67 * 


* Per cents do not add to 100 because of deaths. 


that resulting from the use of the more conservative treatment of psy- 
chotherapy. 

2. The relative curative effects of insulin shock and psychotherapeutic 
treatments of schizophrenic patients are not related with the type of onset 
of the disease. 

3. Early treatment of schizophrenics by insulin shock does not produce 
a higher rate of remission than early treatment by psychotherapy. 


AN EXPERIMENT ON HUMAN MOTIVATION 


The psychological predispositions underlying human motivation have 
been difficult to subject to experimentation. In this area laboratory control 
procedures modify the natural-type variables that the psychologist really 
wants to study. Motivational processes operating under natural life condi- 
tions have proved difficult to analyze. Despite deficiencies in the control 
of these variables, however, some progress is being made. In the follow- 
ing pages a brief description is given of part of a study conducted on 
motivating factors operating in an industrial situation. 

The Problem. One determinant factor in worker motivation that has 
received delayed recognition but one that probably pervades all types of 
job performances from the least skilled worker to the highest adminis- 
trator, is that of identification with the job and the organization. This 
identification can be developed by the worker in many different ways. 
In recent years it has been shown to be frequently associated with the 
degree to which he participates in the formulation and organization of 
the conditions under which he works. It is difficult to reduce this psycho- 
logical factor to rigorous control, and we cannot expect in an actual 
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industrial situation to achieve the degree of control that characterizes a 
laboratory experiment. In the present study, however, a very sound ex- 
perimental design was used, and the facts collected add significant mean- 
ings about the motivational processes characteristic of many industrial 
workers. 

Evolving the Hypothesis. The Source of the Problem. In a sewing 
factory employing mostly women workers, it became necessary to trans- 
fer some of the workers from their accustomed jobs to other jobs that 
and to require some of the workers 
cedures. The workers resisted the 
new jobs were not necessarily more 
ew procedures more difficult than 
f the workers dropped off markedly 
entment toward management was 
turnover rose significantly. 
ns usually made, management came 
’ difficulties were probably not due to 
» as these appeared not to make in- 
: : skills. It was decided that the nega- 
tive reactions of the work probably traceable to motivational 
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changes introduced. It was decided that a study should be conducted in 
which the workers would be given a better understanding of the reasons 
underlying the changes and be allowed to assist in evolving new jobs and 
new kinds of procedures. Accordingly the following theorem was investi- 
gated: worker participation in evolving the necessary changes in pro- 
cedures and jobs will result in a higher level of motivation and therefore 
no slump in production will occur after the changes are introduced. 

The Testing Situation. Three conditions were organized, representing 
three levels of participation of workers in determining and planning the 
changes to be made in jobs and working procedures. Four groups of 
workers were studied (two groups were assigned to condition three), 
and an attempt was made to match the groups in general level of pro- 
ficiency and other relevant factors. The following experimental conditions 
were set up: 

1. Control Group. Management merely announced the changes to the 
workers as in the customary manner. In this condition the worker was 
allowed minimum participation. 

2. Group I. Management conferred with the workers some time before 
the changes were made. The changes were explained in a dramatic 
fashion and management tried to get the workers to come to a general 
agreement that changes were necessary. The workers chose representa- 
tives to cooperate with management in planning the needed changes in 
jobs and procedures. 

3. Groups II and III. The procedure was similar to that for group I 
with the exception that all members of the groups participated in design- 
ing the new jobs and procedures. For instance, every worker contributed 
performance in establishing the piece-rate pay for the changed con- 
ditions. 

On both the old and the new jobs the workers were paid by piece rate, 
so objective criteria were provided for evaluating the workers’ perform- 
ance and for serving as a basis for comparing the performances under 
the several conditions. Another available criterion was the rate of labor 
turnover, 

The Confirmation of the Hypothesis. The quantitative measurement 
of output for the four groups is presented in Fig. 13, in which production 
before the changes were instituted is compared with production after the 
changes. The control group showed the usual drop in production and 
failed to manifest later recovery. Group I (some participation) dropped 
in performance following the changes, continued for about 10 days at a 
lower level of output, but then improved in effectiveness for the next 20 
days. Groups II and III, in which every worker helped to design the 
changes in jobs and procedures, showed only a slight drop at the time 
the changes were made. After about the third day these groups started to 
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increase their production and at the thirty-second day were around 10 
units per hour higher than before the changes were instituted. — 

Other criteria for evaluating the workers performance showed 3 
trends. Worker discontent was far greater in the control group than in 5 
other groups, and similarly, the control group manifested the greates 
increase in labor turnover after the changes were effected. 
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cussion, the stimulating conditions are continuously changing and are 
markedly inconstant for the various participants. 

In the experiment to be described the stimulating conditions were kept 
relatively constant within a specially designed social-interaction situa- 
tion called the group squares test. The experiment was conducted to 
measure the personality responses of individuals who were placed in a 
conflict situation in which their willingness to cooperate to achieve a 
group goal was placed in competition with their desire to satisfy an im- 
mediate individual goal. This study, for the first time, sets up a situation 
of social interaction in which the stimulating conditions for each indi- 
vidual are identical. 

The primary purpose in describing the experiment is to show how an 
ingenious procedure was devised to effect constancy in the physical and 
symbolic stimuli of a group-interaction situation. We shall therefore 
dispense with a detailed consideration of the hypothesis and theorems 
involved in the study and concern ourselves with the procedural char- 
acteristics of the group squares testing situation. 

The Objectives to Be Achieved in the Experimental Situation. The 
several objectives to be achieved in the experimental situation can be 
stated very briefly as follows: 

1. That the situation be one in which social-interaction behavior be- 
tween several persons is elicited 

2. That there be a common group task in which each subject is invited 
to participate 

3. That each subject be given an identical individual task to perform 
that can be accomplished in consonance with or in opposition to the 
achieving of the group goal 

4. That the problems presented to the subjects, the stimulating mate- 
rials used, and the communication among the subjects be held constant 
for every subject 

5. That the behavior be describable in quantitative terms 

The Nature of the Subjects’ Task. The problem presented to the sub- 
jects was similar to the familiar jigsaw puzzle, in which variously shaped 
Pieces of cardboard are fitted together to form a pattern or picture. In 
the group squares test the subject formed a 4-inch square from three 
pieces of cardboard that differed in respect to both size and shape. The 
goal of each subject was to complete his square. 

The group task consisted in the subjects’ cooperating with each other 
to the end that all subjects could complete their squares. Each subject 
Was to assist the others by giving up pieces he had that might aid another 
to complete his square. The group goal was achieved when every sub- 
ject’s square was completed. 
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The Procedure for Keeping the Stimulating Situation Constant. The 
stimulating situation was made constant for all subjects by the following 
procedure: 

1. Presenting every subject with exactly the same number and same 
shaped pieces of cardboard at the beginning of the test 

2. Providing every subject with pieces of identical shapes for trading 
purposes during the process of completing his square 

8. Requesting every subject to give up pieces of exactly the same 
shape after he has completed his square, in order to assist another 
subject 

Constancy of the stimulating conditions was achieved by the experi- 
menter as he dealt with each subject individually. The subjects (in this 
study there were five participants) sat in tablet armchairs formed in a 
large circle, their backs toward the center so they could not observe 
each other at work. The space between chairs was made l 


arge enough 
to enable the experimenter to move in behind 


the subjects when he 
eginning of the experi- 


piece from a different envelope. Although it appe: 
he was drawing pieces randomly, 


subject fitted together, 
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two pieces were placed on the front section of the tray, neither one of 
which could be used by the subject. The subject usually made no ex- 
change but did make known his request on the rear section of the tray. 
In the second trial the piece was provided by which the subject could 
complete his square. Through the next four trials two request pieces ap- 
peared on the rear section of the tray, but these pieces were always differ- 
ent from any held by the subjects. These four trials forced upon each sub- 
ject the notion that, although he had completed his square, the other 
subjects still appeared to be working on theirs. 

The seventh trial was the first critical one. The subject was requested 
to give up a certain piece from his completed square. This placed him 
in a conflict situation, his desire to meet the request conflicting with his 
desire to maintain the completed square. This situation presented an 
opportunity for the subject to cooperate with the others or to maintain 
a more individualistic approach of maintaining his own square intact. If 
he did not meet the request on this trial, then the request was repeated 
on each subsequent trial until he answered it. The score given the subject 
was the number of that critical trial on which he yielded the requested 
piece. After a subject had met the request, he was allowed to complete 
his square on the next trial. The experimenter then kept him active in the 
situation until all squares were completed by presenting him with re- 
quests for pieces that he did not have. 

The Hypothesis and Theorem of the Experiment. The particular experi- 
ment being reported was part of a larger personality assessment program 
in which the definition, analysis, and prediction of effective performance 
was being studied. One of the general hypotheses underlying the study 
was that behavior manifested by an individual in a group-interaction 
situation is prognostic of his personality. Personality was more specifically 
defined as effective performance, and the group squares test was devised 
as a particular kind of social-interaction situation. Effective performance 
was further analyzed into three variables, namely, (1) promise of success 
in a professional field, (2) originality as a scholar or scientist, and (3) 
soundness as a person. Ratings on these variables constituted the criteria. 
The theorem tested was that the group squares test scores are predictive 
of the three measures of effective performance. The experimental subjects 
were graduate students in a university. 

The Results of the Experiment. The validity of the group squares test 
can be studied in terms of the mean ratings on the criteria for different 
ranges of scores on the test, and also by correlating the test scores with 
the criteria ratings. These two types of statistical measures are presented 
in Table 8. Those subjects who met the request by breaking their squares 
on the first critical trial (seventh) are called the “fast” group, those who 
broke their squares on the second and third critical trials the “medium” 
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Table 8. Relation of Group Squares Test Scores to Potential 
Success, Originality, and Soundness 


n Fast Medium Slow be 
Rating N=15 | N=12 = 10 Eta 
Potential suecess. 49.7 55.3 48.0 28 
Originality: as s o siar 51.2 59.1 45.6 41 
Soundness as a person 16.5 54.8 46.7 38 


* Correlation coefficient for nonlinear relationships. 


group, and those who broke their squares from the fourth to the twentieth 
critical trial the “slow” group. From an examination of the mean scores of 
these three groups on each of the three criterion me: 
that the highest scores were obtained by the medium group. These results 
indicate that the most effective performers did not break too quickly or 
too slowly, which is a significant finding meriting further investigation. 
From the column of coefficients it will be noted that the scores on the 
group squares test are positively and significantly related with each of the 
criteria, the coefficients for originality and soundness being very prom- 
ising. 

The results of the experiment clearly point to this form of group-inter- 
action measure as an acceptable diagnostic and predictive instrument 


for assessing personality as it is expressed in terms of the effectiveness of 
performance of the individual, 


asures, it will be seen 
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from a complex, 292 
by discovery of representative signs, 
292, 341 
Manipulation of, by manipulation of 
persons, 294 
measurement of, 293 
through descriptive indices, 293 
through use of measuring devices, 
294, 316 
minimizing of variation in, 306 
Problems, development of conceptual 
framework for, 135, 139 
constituents, determining theoretical 
security of, 137 
discovery of new, 141 
listing of all, 136 


without counter- 
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Problems, development of conceptual 
framework for, meaning of, 135 
effect of, on method, 6, 133 
factual testing of theoretical frame- 
work of, 143 
constituents, indeterminate, anal- 
ysis of, 145 
discovery of unknown, 145 
increasing theory by, 144 
nature of factual analysis in, 143 
standard situations for, 146 
use of simple situations in begin- 
ning of, 143 
generalization as means of attack on, 
236 
influence of, on direction of research, 
134 
knowledge of, as means for controlling 
bias, 134 
as means for determining method, 
134 
specific, development of, 149 
description in, 150 
evolution of, 149 
interchange between theory and 
fact in, 151 
role of questions in, 149 
setting of limits in, 149 
as stimulus to scientist, 11 
Pseudoscience, schemes of, 4 


Quantification of variables (see Measure- 
ment ) 
Questioning, in development of specific 
problem, 149 
in generalization, 237 
Questionnaires, 321 


Ratings, measurement by, accuracy of, 
321 
kinds of, 328 
response in, recording of, 320 
types of stimuli in, 329 
Reasoning, concepts in, 31 
errors in, 17, 32 
reduction of, 33 
importance of, in science, 32 
language in, 31 
nature of, 31 
reliability of, 31 
Records, accuracy of, 201 
from apparatus, limitations of, 201 
comprehensiveness of, 200 
need for, 200, 202 
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Registration, of observations, 200 
of response, 192 
of subject in psychological tests, 320 
as source of error in quantifying vari- 
ables, 313 
of stimuli, 189 
Relational meanings, from analysis of 
composition of complex variable, 217 
from comparisons of variables, 216 
in complex behavior, 270 
measurement of, 232 
ordered in time, 218 
in simple sensory experience, 268 
(See also Causal relationships; Func- 
tional relationships) 
Reliability, logical aspects of, 207 
of meanings, 207 
of perception, 22 
of reasoning, 31 
of remembering, 26 
statistical, 208 P 
Remembering, attitude of skeptic to- 
ward, 26 
dependence of, on learning and reten- 
tion, 27 
error in amount of, 28 
error in fidelity of, 29 
error reduction in, 29 
importance of, in science, 27 
reliability of, 26 
Representativeness, replication of obser- 
vations and, 198 
of responses of subject in inventories 
and questionnaires, 323 
Response, categories of, 190 


characteristic of, in psychological tests, 
319 


detection of, 191 
evaluation of, in interview, 333 ` 
in inventories and questionnaires, 
324 


in psychological tests, 321 
in ratings, 331 
in unstructured sti 
327 
kinds of, in unstructured stimulus sit- 
uations, 327 
process or product, 189 
quantification of, 193, 311 


registration of, 192, 813, 320, 327, 330, 
333 


verbal report, 192 


mulus situations, 


Sampling, errors of, control of, 103 


as related to representativeness of be- 
havior, 198 


Index 


Sampling, variation in, as source of error 
in generalization, 254 
Science, aims of, control, 40 
prediction, 37 
understanding, 36 
concepts of, 52 
empirical phases of, 41 
as general method, 5 
interpretations of meaning of, 3 
methods of, description, 46 
explanation, 48 
symbolization, 50 
theorizing, 50 
as multiplicity of methods, 5 
questions asked by, 35 
rational phases of, 41 
Scientific method, description, 46 
explanation, 48 
in field-type situations, 335 8 
modification of, factors underlying, 5 
observation, 193 
presupposition of, 16 
concerning knowing activities, 21 
concerning physical nature, 16 
(See also Postulates ) 
steps of, 181 
symbolization, 43 
theorizing, 50 
(See also Method) 
Scientist, approach of, 11 
attitude of, safeguards to, 195 
compared with nonscientist, 11 
concept of, 9 
contribution of, 10 
flexibility of, 12 
as observer, 194 
role of, in science, 9 
tolerance of, for change, 13 , 
Selection and classification (see Field- 
type studies) e 
Sensory experience, absence of physical 
counterparts in, 269 
simple relationships in, 268 
Space-perception situations in, 268 
Standard deviation, characteristics of, 
30 


as measure of variation, 230 
Statistical control of variables, in com- 
plex behavior, 86 
conditions essential to, 88 
example, 86 
(See also Control) 
Statistical reliability, 208 
Statistical significance compared with 
Practical significance, 199 


Stimuli, categories of, 185 


Index 


Stimuli, focal and contextual, 184 
interaction among, 188 
production of, 185 
quantification of, 186 
registration of, 189 
representativeness of, 187 
in tests, physical objects, 317 
symbolic objects, 318 
types of, 323 
in interviews, 332 
in inventories and questionnaires, 
323 
in ratings, 329 
in unstructured situations, 326 
Stimulus comparisons, methods of, aver- 
age error, 273 
constant stimulus differences, 271 
equal-appearing intervals, 274 
minimal changes, 272 
pair comparisons, 274 
nature of problem, 271 
Stimulus equality, as function, of ab- 
stractness of stimulus, 277 
of difficulty of task, 278 
of familiarity, 276 
of motor-skill factors, 279 
nature of problem, 275 
Subject factors in psychological ex- 
periments, anatomical-physiological 
types of, 296 
control of interest, attitude, ability, 
and past experience 307 
control of irrelevant variables among, 
302 
attitudes, 306 
motivations, 303 
subject adaptability, < 
predispositional, interrelationships, 290 
isolation of, 292 
manipulation of, 294 
measurement of, 293 
predispositions, examples, 289 
as experimental variables, 288, 291 
Symbolization, correspondence of mean- 
ing in, 43 
deficiency of words in, 45 
demands to be met by, 44 
meaning of, 43 
as method of science, 43 
Systematic variables, control of (see Con- 
trol, of experimental variables) 
nature of, 89 
unwanted, 90, 95 


Tabular procedures, in classification, 219 
for describing frequencies, 220 
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Tabular procedures, for describing per 
cent frequencies, 221 
use of, 219 
Testing, in prediction, empirical type of, 
38 
rational type of, 38 
of theorems derived from hypotheses, 
167, 343 
unrepresentative, errors due to, 251 
Testing situations, for collecting facts, 
177 
conducting observations in, 193 
interaction among variables in, 188 
responses in, 189 
categories, 190 
detection, 191 
process, 189 
products, 189 
quantification, 193 
registration, 192 
verbal report, 192 
stimuli in, 184 
categories, 185 
focal and contextual, 184 
production, 185 
quantification, 186 
registration, 189 
representativeness, 187 
Tests, psychological, areas of measure- 
ment by, 316 
manipulation in, of physical stimuli, 
317 
of symbolic stimuli, 318 
as measures of psychological predis- 
positions, 316 
response in, evaluation of, 821 
recording of, 320 
response characteristics to be meas- 
ured in, 319 
Theorems, error from tests not represent- 
ative of conditions of, 251 
in experiments, on human motivation 
in industry, 350 
statement of, 350 
test of, 351 
on social interaction, 855 
on worker selection and classifica- 
tion, 341 
evolving of, 341 
stating consequences of, 342 
testing of, 343 
inadequate, error from, 250 
test of, nature of, 168 
procedure for devising, 168 
verification of, 169 
Theoretical framework of problem (see 
Problems) 
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Theorizing, in development of problem, 
135, 151 
fallacious examples of, 51 
generalization in, 246 
as method of science, 50 
scientific compared with nonscientific, 
53 
scientific concepts in, 52 
as tool of research, 54 
Theory, generalization as serving pur- 
poses of, 246 
hypothesis and, 161 
scientific law and, 161 


Understanding, as aim of science, 36 
continuum of, 36 
expansion of, 37 
prediction and, 37 
truth and, 36 

Uniformity of nature, definition of, 16 
in functional relationships, 65 

Units of measurement, equality of, 119 
in functionally related variable, 119, 


121, 123 
in method of frequency of occurrence, 
314 


in psychology, equality of, 124 
kinds of, 122 
standard deviations as, 125 
in “thing itself,” 119, 122 
Unstructured stimulus situations, 
urement by, 325 
purpose of using, 325 


meas- 
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Unstructured stimulus situations, re- 
sponses in, evaluation of, 327 
kinds of, 327 
recording of, 327 
stimuli in, kinds of, 326 
structuring by subject in, 326 
Unsystematic variables, control of, 99 
nature of, 90 
sampling errors and, 103 


Validation, of items in inventories and 
questionnaires. 
of selection and classification program, 
groups used in, 344 
Variability, chance, evaluation of, 209 
measures of, analytical use of, 216 
standard deviation as, 230 
need for evaluating, 206 
of performance, 214 
individual differences and, 215 z 
intra-individual differences and, 215 
meaning of, 214 
Verbal reporting, 192 
Verbalization of a hypothesis, 155 


Zero points, common, need for, 120 
as “no amount of the thing,” 120 
nonabsolute or relative, 121 
characteristics of, 127 

in psychological measurements, 126 
absolute, 127 

stable, need for, 120 


